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POLYPEPTIDES OF G- COUPLED RECEPTOR PROTEINS , 
AND COMPOSITIONS AND METHODS THEREOF 

FIELD OF THE INVENTION 

The present invention relates to compounds, 
compositions and methods involving synthetic, isolated and/or 
recombinant G-protein coupled receptor polypeptides that 
comprise fragments and/or consensus peptides of G-protein 
coupled receptors . 

BACKGROUND OF THE INVENTION 

The membrane protein gene superfamily of G-protein 
coupled receptors (GPRs) has been characterized as having seven 
putative transmembrane domains. The domains are believed to 
represent transmembrane a- helices connected by extracellular or 
cytoplasmic loops . Of the 74 sequenced members of this 
G-protein receptor superfamily, the shortest sequence of 324 
amino acids represents the rat mas oncogene and the longest, of 
744 amino acids, represents the human thyroid- stimulating 
hormone (TSH) receptor. GPRs thus include a wide range of 
biologically active receptors, such as hormone-, viral-, growth 
factor- and neuroreceptors. 

G-protein coupled receptors have been characterized as 
including these seven conserved hydrophobic stretches of about 
20-30 amino acids, connecting at least 8 divergent hydfophilic 
loops. The G-protein family of coupled receptors includes 
dopamine receptors which bind in a noncovalent but high affinity 
manner to neuroleptic drugs used for treating psychotic and 
neurological disorders. For example, the dopamine D : receptor 
includes these transmembrane domains, two of which (TM III and 
TM V; see below) have been implicated by site- selective 
mutagenesis to demonstrate functional, association with D : 
liga nds . 

Transmembrane domains of G-protein coupled receptors 
are designated TM1, TM2 , TM3 , TM4 , TM5 , TM6 and TM7 . TM4 , TM5 , 
TM6 and TM7 are the most highly conserved and are postulated to 
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provide sequences which impart biological activity to GPRs. 
Most GPRs have single conserved cysteine residues in each of the 
first two extracellular loops which form disulfide bonds that 
are believed to stabilize functional protein structure*. TM3,is 
5 also implicated in signal transduction. 

Phosphorylation and lipidation (palmitylation or 
f arnesylation) of cysteine residues can influence signal 
transduction of some GPRs. Most GPRs contain potential 
phosphorylation sites (e.g., serine or theronine residues) 

10 within the third cytoplasmic loop and/or the carboxy terminus. 

For several GPRs, such as the 0-adrenoreceptor , phosphorylation 
by protein kinase A and/or specific receptor kinases mediates 
receptor desensitizat ion . 

Non- limiting examples of GPRs include cAMP receptors, 

15 adenosine receptors, ^-adrenergic receptors, muscarinic 

acetylcholine receptors, a- adrenergic receptors, serotonin 
receptors (5-HT) , histamine H2 receptors, thrombin receptors, 
kinin receptors, follicle stimulating hormone receptors, opsins 
and rhodopsins, odorant receptors, cytomegalovirus receptor, 

20 etc. See e.g., Probst et al DNA and Cell Biology 11:1-20(1992), 

which is entirely incorporated herein by reference. 

The ligand binding sites of GPRs are believed to 
comprise a hydrophilic socket formed by several GPR 
transmembrane domains, which socket is surrounded by hydrophobic 

25 residues of the GPRs. The hydrophilic side of each GPR 

transmembrane helix is postulated to face inward and form the 
polar ligand binding site. TM3 has been implicated in several 
GPRs as having a ligand binding site, such as including the TM3 
aspartate residue. Additionally, TM5 serines, a TM6 asparagine 

30 and TM6 or TM7 phenylalanines or tyrosines are also implicated 

in ligand binding. 

GPRs can be intracellularly coupled by heterotrimeric 
G-proteins to various intracellular enzymes, ion channels and 
transporters. See, e.g., Johnson et al Endoc . JRev. 

35 10:317-331(1989) ; and Birnbaumer et al Biochem. Biophys. Acta 

1031:163-224(1990) which references are incorporated entirely 
herein by reference. GPR agonist binding catalyzes the exchange 



WO 94/05695 



o 



PCT/US93/0852X 



- 3 - 

of GTP for GDP on the a-subunit of the G-protein. Different: 
G-protein a-subunits preferentially stimulate particular 
effectors to modulate various biological functions in a cell. 
Phosphorylation of cytoplasmic residues of GPRs has been 
identified as an important mechanism for the regulation of 
G-protein coupling of some GPRs. 

As a non-limiting example of a GPR ligand, dopamine 
( 3 , 4 -dihydroxyphenethylamine) is a critical neurotransmitter in 
the central nervous system (e.g., in the substantia nigra, 
midbrain, and hypothalamus) . Since the elucidation of the 
ascending mesolimbic and nigrostriatal pathways, these pathways 
have been found to be critical in the control of both motor 
initiation (nigrostriatal) behavior and affective (mesolimbic) 
behavior. The clinical efficacy of the major neuroleptic 
antipsychotic medications has been found to correlate with the 
respective affinities of these agents for the dopamine D 2 
receptor in the brain. A dopaminergic role in the 
symptomatology of the major psychoses has thus been 
hypothesized, although it is unclear if dopamine alone is 
etiological, (see, e.g., Davis et al . Am. J. Psych. 148:1474- 
1476 (1991)). Nonetheless, this hypothesis has served as a 
stimulus for current research in this area. 

One model for studying possible interactions of 
G-protein coupled receptors with their ligands has emerged from 
site -directed mutagenesis and biochemical analysis of the 
/3-adrenergic receptor, as well as from biophysical analysis of 
the interaction of retinal with opsin. 

According to such a model, the binding of a GPR ligand 
to a G-protein coupled receptor involves multiple interactions 
between functional groups on the GPR ligand and residues within 
the hydrophophilic binding site of the receptor. 

While a number of the amino acid residues in the 
dopamine D 2 receptor have been postulated to participate in D : 

ligand binding, based on results obtained from sj rc-nirprfpH 

mutagenesis studies and photoaf f inity labeling studies performed 
on the /3-adrenergic receptor, such studies have failed to 
specifically determine which residues are actually involved ir. 
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binding in the D : system. Sibley et ai . Soc. Neurosci. Abs , 
17:36.10, 324.5, 324.6 (1991). 

The clinical use of neuroleptics has provided a means 
for treating patients suffering from psychotic disorders. 
Short-term use of neuroleptics is indicated in several types of 
psychotic disorders, e.g., acute psychotic episodes, regardless 
of type; exacerbations of schizophrenia; acute manic excitement 
while deferring use of lithium or awaiting onset of its effects; 
adjunctive therapy for major depression with prominent psychotic 
symptoms, or when an antidepressant or ECT alone is not 
successful; for agitation in delirium, dementia, or severe 
mental retardation while seeking to identify and treat the 
primary basis of the problem; in certain chronic, degenerative, 
or idiopathic neuropsychiatric disorders with dyskinesias, such 
as Huntington's disease or Gilles de la Tourette ' s syndrome; or 
for ballism or hemiballism; childhood psychoses or apparently 
allied conditions marked by severe agitation or aggressive 
behavior; miscellaneous medical indications, notably nausea and 
vomiting, or intractable hiccups. 

Additionally, continuous long-term use of neuroleptics 
is indicated in many psychotic disorders, such c.s (for more than 
six months) (i) primary indications such as Schizophrenia, 
Paranoia 1 * 6 , Childhood psychoses, some degenerative or idiopathic 
neuropsychiatric disorders (notably, Huntington's disease and 
Gilles de la Tourette's syndrome); (ii) secondary indications 
such as extremely unstable manic-depressive or other episodic 
psychoses (unusual) , otherwise unmanageable behavior symptoms in 
dementia, amentia, or other brain syndromes; and (iii) 
questionable indications such as chronic characterological 
disorders with schizoid, "borderline, " or neurotic 
characteristics; substance abuse; or antisocial behavior, 
recurrent mood disorders. See, e.g., Baldessarini , Chemotherapy 
in Psychiatry, Revised and Enlarged Edition, Harvard University 
Press, Cambridge, MA, (1985), the contents of which is entirely 
incorporated herein by reference. 

Neuroleptics are also referred to as neuroplegics, 
psychoplegics , psycholeptics , antipsychotics and major 
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tranquilizers, but are sometimes distinguished from non- 
neuroleptic anti -psychotics . Neuroleptics have recently been 
characterized as an agent that produces sedative or 
tranquilizing effects, and which also produces motor side 
effects, such as catalepsy or extrapyramidal symptomatology. 
Nonlimiting representative examples of neuroleptics include 
phenothiazine derivatives (e.g., chlorpromazine) ; thioxanthine 
derivatives (e.g., thiothixene); butyrophenone derivatives 
(e.g. , haloperidol) ; dihydroindolone (e.g. , molindone) ; 
dibenzoxazepine derivatives (e.g., loxapine) ; and "atypical" 
neuroleptics (e.g., sulpiride, remoxipiride pimozide and 
clozapine) . See Berstein Clinical Pharmacology Littleton, 
Mass.iPSG Publishing (1978); Usdin et al Clinical Pharmacology 
in Psychiatry New York: Elsevier North-Holland (1981) ; and 
Baldessarini, supra, (1985); and , which references are herein 
entirely incorporated by reference. 

The term "atypical neuroleptics" has been used to 
describe antipsychotic neuroleptics that produce few or no 
extrapyramidal side effects and which do not cause catalepsy in 
animals (See, e.g., Picket et al, Arch. Gen. Psychiatry 49:345 
(May 1992) . Alternatively, atypical neuroleptics, such as 
clozapine, have been described as those neuroleptics which have 
a higher affinity for D 4 and D, sites than for D : sites (See, 
e.g., Davis et al Amer. J. Psych. 148:1474, 1476 (November 
1991) . 

The long term use of all known anti -psychotics , such 
as neuroleptics or non-neuroleptic antipsychotics, has resulted 
in serious side effects, as present in Table I, such as 
persistent and poorly reversible motoric dysfunctions (e.g., 
tardive dyskinesia) in a significant number of patients. These 
side effects are especially prevalent in geriatric populations, 
and adequate pharmacological treatment of these debilitating 
motoric dysfunctions is not currently available. This problem 
has sev erely limited the long-term, clinical administration of 
these agents . 
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TABLE I 

Neurological Side Effects of 
Neuroleptic -Antipsychotic Drugs 



Reaction Features Period of Proposed mechanism Treatment 

maximum risk 



Acute dystonia 



Spasm of muscles of 
tongue, face. neck, 
back: may mimic 
seizures: not 
hystencat 



1 -5 oays 



Dopamine excess? 
Acetylcholine excess? 



Antiparkinsomsm agems are 
diagnostic and curative 
(i.m. or ;.v., then o.o.) 



Parkinsonism Bradykinesia, rigidity, 5-30 days 
variable tremor, mask- (rarely 
factes, shuffling gait persists) 



Dopamine blockade 



Antiparkinsomsm agents 
Ip.o); dopamine agonists 
risky? 



Akathisia Motor restlessness: 5-60 days 

patient may experience (commonly 
anxiety or agitation persists) 



Unknown 



Reduce dose or change drug 
tow doses of propranolol:' 
antiparkinsomsm agents or 
or benzodiazepines may help 



Tardive Oral-facial dyskinesia: 6*24 months 

dyskinesia chorco-athetosis, some- (worse on 

times irreversible. withdrawal! 

rarely progressive 



Dopamine excess? 



Prevention best; treatment 
unsatisfactory; slow spontaneous 
remission 



"Rabbit" 
syndrome 



Malignant 
synarome 



Perioral tremor (late 
parkinsonism variant?!; 
usually reversible 

Catatonia, stupor, 
fever, unstable pulse 
and blood pressure: 
myoglobinemia; can 
be fatal 



Months or 
years 

Weeks 



Unknown 



Unknown 



Antiparkinsomsm agents; reduce 
dose of neuroleptic 



Stop neuroleptic; antiparkinsomsm 
agents usually fail; bromocriptine 
often helps: dantrolene variable; 
general supportive care crucial 



a. There may be an increased risk of hypotension on interacting high doses of propranolol with some antipsychotic 
agents; clonidine may also be effective at doses of 0.2*0.8 mg/day, but carries a high risk of hypotension (Zubenko et ai., 
Psychiatry Res. 11:143, 1984). 

In addition, clozapine, although apparently capable o 
producing less motor side effects, can cause irreversible, 
potentially fatal agranulocytosis in a minority of patients 
administered the drug. Such serious side effects limit the use 
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clozapine to patients who are resistant to treatment with other 
neuroleptics . 

Antipsychotics have a variety of significant 
pharmacological effects, e.g., as presented in the following 
Tables II and III. 

Table II 

Comparative Pharmacology of Neuroleptics 





Phenothiazine 


Thioxanthene 


Butyrophenone 




Derivative 


Deri vat i ve 


Deri vat i ve 


Alkaloid 








Pharmacologic Actions 


Chlorpromazine 


Thiothixene 


Haloperidot 


Antipsychotic 


Yes ♦ ♦ 


Yes ♦ ♦ 


Yes + «■ ♦ ♦ 


Antiemetic 


Yes + ♦ ♦ 


Not tested 


Yes ♦ ♦ + 


Hypothermia 


Yes * 


Yes + 


No 


Hypotension 


Yes ♦ ♦ 


Yes ♦ * ♦ 


+ 


Parkinsonism 


Yes + ♦ 


Yes ♦ 


Yes ♦ + + * 


Ant i adrenergic 


Yes + + 


Yes ♦ * * 


+ 


Antichol inergi c 


Yes ♦ 


Yes * 


Negl igible 


Antihistaminic 


Yes ♦ 


Negi igibte 


Negl i gible 


Releases NE, OA 


No 


NO 


No 


Blocks DA 


Yes + * 


Yes + 


Yes ♦ * + + 


Blocks NE 


Yes * + 


Yes ♦ + + 


Yes + 


Central sympathetic 


Yes ♦ + 


Yes + 


Yes ♦ + + 



suppressant 

ChlorpromaziRa. thiothixene , and haloperidot docreaoe the functional avoiiebilrty of dopamine (DA> ana norepinephrine INE) by blocking the 
dopamine receptor ohm in the baoal ganglia and norepinsphrine receptor sitos tn thalamic end hypothalamic areas . Reoerpine simply reduces the 
concent ratio no ot norepinephrine and dopamine in theoe oreeo. Both ol theoe actio ro reoutt in suppression of cantrat sympathetic activity 
+ + + indicates from very weak to very otrong ottects. 



Table III 

Comparative Pharmacology of Antipsychotics 



Extrapyramidal 
Drug 



Sedation 



Adrenergic 
Blockage 



Reaction 



Chlorpromazine 

Chlorprothixene 

Haloperidol 

Molindone 

Loxapine 



High 
High 
Low 

Moderate 
High 



Moderate to high 

High 

Low 

Moderate 

Low to moderate 



Moderate 

Low to moderate 

High 

Moderate to high 
High 



See Ebadi, PHARMACOLOGY , Little, Brown and Co., Boston, 61-65 

(1985); Cattabeni et al Adv. Biochem. Psychopharmacology 24:275 

(1980). Baldessarini , supra, which references are herein 

incorporated entirely by reference. 
5 However, despite the fact that thousands of neuroleptic - 

or ant ipsychotic - type compounds have been synthesized and reported 
in the literature, sucn compounds which lack serious side effects 

and which have sufficient pharmacological activity, have not been 

disclosed . 
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Alternative to dopamine receptor GPRs, as presented 
above, other neuroreceptor GPRs are involved in neurological 
pathologies, and drugs such as neuroreceptor GPR binding agents, 
presently used for treating these pathologies, also suffer from 
5 similar side effects as those of neuroleptics, as presented above. 
Other GPRs are also involved in receptor- related 
pathologies, such as hormone related GPRs involved in endocrine 
related pathologies. 

Accordingly, there is a need to provide G-protein coupled 

10 receptor binding agents, including neuroreceptor and endocrine 
receptor GPRs, which do not produce such deleterious and 
debilitating side effects as those produced by known agents, such 
as neuroleptics, which can be used for therapy or diagnosis of GPR 
related pathologies. 

15 . Citation of documents herein is not intended as an 

admission that any of the documents cited herein is pertinent 
prior art, or an admission that the cited documents are considered 
material to the patentabilty of the claims of the present 
application. All statements as to the date or representations as 

2 0 to the contents of these documents are based on the information 
available to the applicant and does not constitute any admission 
as to the correctness of the dates or contents of these documents . 



non- naturally occurring synthetic, isolated and/o^ recombinant GPR 
polypeptides which are fragments, consensus fragments and/or 
sequences having conservative amino acid substitutions, of at 

3 0 least one transmembrane domain of at least one G-protein coupled 
receptor, which polypeptides have been discovered to have 
receptor- like functional binding sites of neuroreceptor and 
endocrine GPRs, such that GPR polypeptides of the present 
invention may bind GPR ligands, or which may also modulate, 

35 quantitatively or qualitatively, GPR ligand binding to GPRs. 



SUMMARY OF THE INVENTION 



25 



It is therefore an object of the present invention to 
overcome one or more deficiencies found in the related art. 

It is another object of the present invention to provide 
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It is still another object of the present invention to 
provide GPR polypeptides and compositions that have only partially 
helical structures, in contrast to known characterized 
transmembrane domains of GPRs, such as, but not limited to, GPR 
5 transmembrane domains I-VTI. 

It is yet another object of the present invention to 
provide synthetic or recombinant GPR polypeptides, conservative 
substitution derivatives thereof, antibodies, anti- idiotype 
antibodies, compositions and methods that can be used as potential 

10 modulators of G-protein coupled receptor function, by binding to 
GPR ligands or modulate GPR ligand binding, due to their expected 
biological properties, which may be used in diagnostic, 
therapeutic and/or research applications. 

It is a further object of the present invention is to 

15 provide synthetic, isolated or recombinant polypeptides which are 
designed to inhibit or mimic various GPRs or fragments thereof, as 
receptor types and subtypes . 

According to one aspect of the present invention, a 
synthetic or recombinant GPR polypeptide is provided that 

20 comprises a GPR amino acid sequence of, e.g., at least 5, 10, 15 
or 20 amino acids, substantially corresponding to at least one 
transmembrane domain, or fragment and/or consensus peptide 
thereof, of a G-protein coupled receptor, wherein at least 20 
amino acids are preferred. In a preferred embodiment, the 

25 polypeptide is (a) chemically synthesized and/or (b) obtained from 
a recombinant host cell or organism which expresses a recombinant 
nucleic acid encoding a GPR polypeptide, as defined herein. 

In another preferred embodiment, the transmembrane domain 
is selected from at least one of TM1, TM2 , TM3 , TM4 , TM5 , TM6 or 

3 0 TM7 , corresponding to transmembrane domains I, II, III, IV, V, VI 
and VII, respectively, of a GPR. In another preferred embodiment, 
the transmembrane domain is a dopamine receptor transmembrane 
domain selected from the group consisting of at least one of a D,, 
D : . D 3 , D 4 and D 5 dopamine receptor transmembrane domain. The 

35 transmembrane domain, e.g., may be selected from at least one of D. 
receptor transmembrane domains III or V. In still another 
preferred embodiment, the GPR polypeptide amino acid sequence 
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substantially corresponding to an amino acid sequence contained in 
at least one of Fig. 2 (SEQ ID N0:2), Fig. 3 (SEQ ID NO:3) or Fig. 
5 (SEQ ID NO: 5) . / / ^ * - 

In another aspect of the present invention, a GPR 
composition is provided, comprising a GPR polypeptide, or a 
pharmaceutically acceptable ester, ether, sulfate, carbonate, 
malate, glucuronide or salt thereof, the composition further 
comprising a pharmaceutically acceptable carrier and/or diluent. 

In still another aspect of the present invention, a 
method is provided for treating a subject suffering from a disease 
state involving a qualitative or quantitative pathological 
abnormality of a GPR protein or a biological molecule functionally 
associated therewith. Such biological molecule may be a membrane 
cytoplasmic protein, lipid, carbohydrate, saccharide, nucleoside 
or nucleotide mono-, di-, or tri- phosphate, an enzyme, a co- 
factor, a nucleic acid, a neurotransmitter, an ion, a carrier, a 
cell receptor, or any combination thereof. 

In a preferred embodiment, the GPR protein is a dopamine 
receptor and the abnormality involves a dopamine related 
pathology, wherein the method comprises administering an effective 
dopamine receptor modulating amount of a GPR polypeptide of the 
present invention. In another preferred embodiment, the 
transmembrane domain is a D : dopamine receptor domain and the 
disease state is a psychiatric disorder, such as schizophrenia or 
schiz affective disorder (see American Psychiatric Association, 
Revised Manual of Diagnostic and Statistical Criteria for 
Psychiatric Disorders (DSM-III-R) . American Psychiatric Assoc. 
Press, Washington, DC (1989)). 

In another preferred embodiment, the GPR composition is 
administered as a pharmaceutical composition to provide a GPR 
polypeptide in an amount ranging from about 0.01 ^g to 100 mg/kg, 
and also preferably, about 10 fig to 10 mg/kg. In another 
preferred embodiment, the administering is by oral, intravenous, 
intramuscular, parenteral or topical administration, including 
mucosal administration to the nasal mucosa or the oral mucosa, by 
aerosol, nebulizer or drop administration as non- limiting 
examples . 



0 
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Other objects of the invention will be apparent to 



skilled practitioners from the following detailed description and 
examples relating to the present invention. 



Figure 1 is the amino acid sequence of a control peptide 
(SEQ ID N0:1), which is hydrophobic in its properties; but does 
not correspond to a known GPR transmembrane domain. 

Fig. 2 represents the amino acid sequence of a GPR 
10 transmembrane polypeptide, polypeptide II (SEQ ID N0:2), which 

corresponds to a portion of the dopamine D : receptor transmembrane 
segment III. 

Fig. 3 represents the amino acid sequence of a 
transmembrane polypeptide, polypeptide III (SEQ ID NO:3), 
15 corresponding to a consensus peptide of the dopamine D 2 receptor 
transmembrane domains I -VII. 

Fig. 4 represents the amino acid sequence of a consensus 
sequence of transmembrane domains that is shortened to be less 
than the length required to span a lipid bilayer. 
20 Fig. 5 represents a consensus amino acid sequence of 

transmembrane domain as a consensus peptide between dopamine 
receptors D, and D 3 . 

Fig. 6 is a representation of a circular dichroism 
spectrum of a solution of the consensus polypeptide III (SEQ ID 
25 N0:3) of Fig. 3. 

Fig. 7 is a graphical representation of radioligand 
binding assay data comparing control polypeptide II (SEQ ID NO:l) 
of Fig. l f labeled as "11" and consensus polypeptide I (SEQ ID 
NO: 3) of Fig. 3, labeled as "I". 
30 Fig. 8A-G are a comparison listing of amino acid 

sequences of transmembrane domains and adjacent amino acid 
sequences of representative GPRs (SEQ ID N0S:6-79). 



BRIEF DESCRIPTION OF THE FIGURES 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invencion relates to G-protein coupled 
receptor (GPR) polypeptides which can be used to mimic naturally 
occurring or isolated GPRs, or to modulate the binding of GPR 
5 ligands to GPRs , such as inhibition or enhancement of binding. 
GPR polypeptides of the present invention can include GPR 
transmembrane domain fragments and/or consensus peptides thereof, 
of at least 4-10 amino acids in length, and/or corresponding 
sequences having conservative amino acid substitutions as 

10 "substitution peptides", wherein the GPR polypeptide binds a GPR 
ligand or modulates the binding of a GPR ligand to a GPR in vitro , 
in vivo or in situ. 

GPR polypeptides of the present invention can be 
synthesized or recombinantly produced, or optionally purified, to 

15 provide commercially useful amounts of GPR polypeptides for use in 
therapeutic, diagnostic or research applications, according to 
known method steps, see, e.g., Ausubel et al, eds. Current 
Protocols in Mo lecular Biology . Wiley Interscience , N.Y., (1987, 
1992); and Sambrook et al, Molecular Cloning, A Laboratory Manual r 

20 2nd edition, Vols. 1-3, Cold Spring Harbor Press, (1989), which 
references are herein entirely incorporated by reference. 

Additionally, GPR polypeptides according to the present 
invention can be used to generate polyclonal and/or monoclonal 
antibodies, anti- idiotype antibodies thereto, or fragments 

25 thereof, which may used for diagnostic and/or therapeutic 

applications, according to known method steps, see, e.g., Harlow 
and Lane, Antibodies: A La boratory Manual . Cold Spring Harbor 
Press (1988), which is herein entirely incorporated by reference. 

GPR polypeptides, anti -GPR antibodies or anti - idiotype 

30 antibodies (or fragments thereof) to GPR polypeptides have been 
unexpectedly discovered to quantitatively or qualitatively 
modulate G-protein coupled receptors, such that binding of GPR 
polypeptides or anti- idiotype antibodies (or fragments thereof) to 
G-protein coupled receptor ligands may be used for diagnostic 

35 research or therapeutic applications of the present invention. 
Such GPR polypeptides, antibodies or anti- idiotype antibodies of 
the present invention may therefore be used as modulators of 
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G-protein coupled receptors, such as neuroreceptors or endocrine 
receptors, as non- limiting examples. 

Binding of such GPR polypeptides, (including GPR 
fragments, consensus peptides, substitution derivatives and anti- 
5 idiotype antibody fragments) of the present invention may be used 
to treat symptoms of, and provide diagnosis and treatment for, 
pathologies related to GPRs . Such pathologies have been found to 
correlate with symptoms occurring in neurological, viral or 
endocrine pathologies. D 2 receptor- related psychotic disorders, 
10 including schizophrenia, now treated with neuroleptics, is a non- 
limiting example thereof. 

The use of synthetic or recombinant GPR polypeptides of 
the present invention can be preferable to the use of known drugs 
that bind G- protein coupled receptors, such as neuroleptics that 
bind or inhibit the biological effect of binding to neuroreceptors 
as a non- limiting example. Such polypeptides are expected to have 
significantly less side effects than presently used drugs 
presently used for inhibiting such receptor binding including 
neuroleptics, as they would structurally mimic naturally occuring 
GPRs and/or modulate ligand binding. Thus, GPR polypeptides are 
expected to have reduced side effects attributable to known 
foreign compound drugs, with less immunogenicity , and reduced 
potential for motoric side effects (e.g., extrapyramidal symptoms 
and/or tardive dyskinesia) . 

The present invention is also related to the production, 
by chemical synthesis or recombinant DNA technology, of GPR 
polypeptides, preferably as small as possible while still 
retaining sufficiently high affinity or interaction with G-protein 
coupled receptors to modulate, such as to inhibit or to enhance, 
binding to such receptors by GPR ligands. 

GPR polypeptides of the present invention may include 5- 
10 to 50-150 amino acid fragments, consensus sequences or 
substitution sequences of GPRs, e.g., as presented in Fig. 8A-G 
(SEQ ID NOS:6-79) including, but nnr limits fn, ^n-jrl . .luy jminc 
receptors, cAMP receptors, adenosine receptors, 0- adrenergic 
receptors, muscarinic acetylcholine receptors, a- adrenergic 
receptors, serotonin receptors (5-HT), histamine H2 receptors, 
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thrombin receptors, kinin receptors, follicle stimulating hormone 
receptors, opsins and rhodopsins, odorant receptors, 
cytomegalovirus GPRs, adenosine A2 receptors, dopamine receptor, 
histamine H2 receptors, octopanmine receptors, N-formyl receptors, 
anaphylatoxin receptors, thromboxane receptors, IL-8 receptors, 
platelet activating factor receptors, endothelin receptors, 
bombesin gastrin releasing peptide receptor, neuromedin B 
preferring bombesin receptors, vasoactive intestinal peptides, 
neurotensin receptors, bradykinin receptors, thyrotropin- releasing 
hormone receptors, substance P receptors, neuromedin K receptors, 
adrenal angiotensen II type I receptors, nias oncogene 
(angiotensin) receptors lutropin- choriogonadotropin receptors, 
thyrotropin receptors, follicle stimulating hormone receptors, 
cannabinoid receptors, glucocorticoid- induced receptors, 
endothelial cell GPRs, testis GPRs, and thoracic aorta GPRs, and 
homologs thereof having a homology of at least 80*; with at least 
one of transmembrane domains 1-7, as described herein. See, e.g., 
Probst et al DNA and Cell Biology 11:1-20(1992), which is entirely 
incorporated herein by reference. 

Accordingly, a "G- protein coupled receptor polypeptide" 
or " GPR polypeptide" of the present invention includes 
polypeptides having a "GPR amino acid sequence" which 
substantially corresponds to at least one 10 to 50 amino acid 
fragment and/or consensus sequence of a known GPR or group of 
GPRs, wherein the GPR polypeptide has homology of at least 80%, 
such as 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94,' 
95, 96, 97, 98, 99 or 100% homology, while maintaining GPR 
modulating activity, wherein a GPR polypeptide of the present 
invention is not naturally occurring or is naturally occurring but 
is in a purified or isolated form which do^s nnr occur in nature. 
Preferably, a GPR polypeptide of the present invention 
substantially corresponds to a transmembrane domain of a GPR or 
group of GPRs as a consensus sequence. 

Also preferred are GPR polypeptides wherein the GPR amino 
acid sequence is 4-10 to 50 amino acids in length, such as 5, 6, 
7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, -0, 21, 22, 23,' 
24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36. 37. 3fi \o 
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40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 
110, 120, 130, 140 or 150 amino acids, or any range therein. 

An amino acid or nucleic acid sequence of a GPR 
polypeptide of the present invention is said to "substantially 
5 correspond" to another amino acid or nucleic acid sequence, 

respectively, if the sequence of amino acids or nucleic acid in 
both molecules provides polypeptides having biological activity 
that is substantially similar, qualitatively or quantitatively, to 
the corresponding fragment of at least one GPR transmembrane 

10 domain, or which may be synergistic when two or more transmembrane 
domains, consensus sequences or homologs thereof are present. 

Additionally or alternatively, such "substantially 
corresponding" sequences of GPR polypeptides include conservative 
amino acid or nucleotide substitutions, or degenerate nucleotide 

15 codon substitutions wherein individual amino acid or nucleotide 
substitutions are well known in the art. 

Alternatively or additionally, substantially 
corresponding refers to GPR polypeptides having amino acid 
sequences having at least 80% homology or identity to an amino 

20 acid sequence of SEQ ID NO:l, such as 80, 81, 82, 83, 84, 85, 86, 
87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99 or 100% 
homology or identity. 

Accordingly, GPR polypeptides of the present invention, 
or nucleic acid encoding therefor, include a finite set of 

25 substantially corresponding sequences as substitution peptides or 
polynucleotides which can be routinely obtained by one of ordinary 
skill in the art, without undue experimentation, based on the 
teachings and guidance presented herein. For a detailed 
description of protein chemistry and structure, see Schulz, G.E. 

30 et al . , Principles of Protein Structure, Springer- Verlag , New 
York, 1978, and Creighton, T.E., Proteins: Structure and 
Molecular Properties, W.H. Freeman & Co., San Francisco, 1983, 
which are hereby incorporated by reference. For a presentation of 
nucleotide sequence substitutions, such as rndnn profor an ccc> 3CC — 

35 Ausubel et al , supra t at §§ A. 1 . l-A. 1 . 24 , and Sambrook et al , 
supra , at Appendices C and D. 
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Conservative substitutions of a GPR polypeptide of the 
present invention includes a variant wherein at least one amino 
acid residue in the polypeptide has been conservatively replaced 
by a different amino acid. Such substitutions preferably are made 
5 in accordance with the following list as presented in Table IV, 
which substitutions may be determined by routine experimentation 
to provide modified structural and functional properties of a 
synthesized polypeptide molecule, while maintaining the receptor 
binding, inhibiting or mimicking biological activity, as 
10 determined by known GPR receptor activity assays. 



Table IV 



Original 


Exesaolarv 


Residue 


Substitution 


Ala 


Gly;Ser 


Arg 


Lys 


Asn 


Gin; His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Ala; Pro 


His 


Asn; Gin 


He 


Leu; Val 


Leu 


He; Val 


Lys 


Arg; Gin; Glu 


Met 


Leu; Tyr; He 


Phe 


Met ; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp ; Phe 


Val 


He; Leu 



Alternatively, another group of substitutions of GPR 
polypeptides of the present invention are those in which at least one 
amino acid residue in the protein molecule has been removed and a 
different residue inserted in its place according to the following 
Table V. The types of substitutions which may be made in the 
protein or peptide molecule of the present invention may be based on 
analysis of the frequencies of amino acid changes between a 
homologous protein of different species, such as those presented in 
Table 1-2 of Schulz et al . , supra and Figs. 3-9 of Creighton, supra. 
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Based on such an analysis, alternative conservative substitutions are 
defined herein as exchanges within one of the following five grouos : 

TABLE V 

1. Small aliphatic, nonpolar or slightly polar residues: Ala Ser 
Thr (Pro, Gly) ; 

2. Polar, negatively charged residues and their amides: Asp, Asn 
Glu, Gin; 

3. Polar, positively charged residues: 
His, Arg, Lys; 

4. Large aliphatic, nonpolar residues: 
Met, Leu, He, Val (Cys) ; and 

5. Large aromatic residues: Phe, Tyr, Trp . 

The three amino acid residues in parentheses above have 
special roles in protein architecture. Gly is the only residue 
lacking any side chain and thus imparts flexibility to the chain. 
This however tends to promote the formation of secondary structure 
other than a-helical. Pro, because of its unusual geometry, tightly 
constrains the chain. It generally tends to promote /?- turn- like 
structures, although in some cases Cys can be capable of 
participating in disulfide bond formation which is important in 
protein folding. Note the Schulz et al . would merge Groups 1 and 2, 
above. Note also that Tyr, because of its hydrogen bonding 
potential, has significant kinship with Ser, and Thr, etc. 

Conservative amino acid substitutions according to the 
present invention, e.g., as presented above, are known in the art and 
would be expected to maintain biological and structural properties 
15 of the polypeptide after amino acid substitution. Most deletions and 
insertions, and substitutions according to the present invention are 
those which do not produce radical changes in the characteristics of 
the protein or peptide molecule. "Characteristics" is defined in a 
non- inclusive manner to define both changes in secondary structure, 
e.g. a- helix or 0- sheet, as well as changes in physiological 
activity, e.g. in receptor binding assays. 

However, when the exact effect of the substitution, 
deletion, or insertion is to be confirmed one skilled in the art will 
appreciate that the effect of the substitution or substitutions will 
be evaluated by routine screening assays, either immunnassayc or 
bioassays to confirm biological activity, such as receptor binding 
or modulation of ligand binding to the corresponding GPR. See, e.g., 
Maranges et al . , eds . , for example, a substituted polypeptide 
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typically is made by sice- specif ic mutagenesis of the peptide 
molecule -encoding nucleic acid, expression of the mutant nucleic acid 
in recombinant cell culture, and, optionally, purification from the 
cell culture, for example, by immunoaf f inity chromatography using a 
specific antibody on a chemically derivatized column or immobilized 
membranes or hollow fibers (to absorb the mutant by binding to at 
least one epitope) . 

A preferred use of this invention is the production, by 
chemical or recombinant DNA technology, of GPR polypeptides, 
preferably as small as possible while still retaining sufficiently 
high affinity for binding to, or association with, GPRs. By 
production of GPR polypeptides including smaller fragments or 
variants of such transmembrane domains, one skilled in the art, using 
known binding and inhibition assays, can readily identify the GPR 
polypeptides capable of binding minimizing or modulating G-protein 
coupled receptors using known methods. Non- limiting examples of 

fragments of GPRs to be used as GPR polypeptides or as a basis for 
consensus sequences thereof for GPR polypeptides, are presented in 
Figs. 2-5 and Fig. 8A-G, wherein fragments or consensus sequences of 
10 to 5 0 amino acids of at least one sequence of Figs. 2-5 or 
corresponding to at least one transmembrane domain or domains 1-7 
listed in Fig. 8A-G (SEQ ID NOS:6-79) are encompassed by the present 
invention, such as at least one transmembrane domain of one or more 
GPRs, such as a cAMP receptor (l), adenosine receptors (2-3); 
muscarinic acetylcholine receptors (4-8); human adrenergic receptors 
(9-11, 14-16, 19-25, 28); adrenergic receptors (9-28); human thrombin 
receptor (31); endothelin receptors (35-36), bombesin receptors (37- 
38), endocrine receptors (48-50), rhodopsin (51). opsins (52-54), 
odorant receptors (55-64), and cytomegalovirus GPRs (72-54), as non- 
limiting examples, wherein (-#-) refers to the listed sequences in 
Fig. 8A-G. 

Accordingly, GPR polypeptides may include consensus 
sequences and/or fragments of at least one of transmembrane domain 
1-7 of one or more GPRs as presented in Figs. 2-5 (SEQ ID NO: 2 -5) or 
Fig. 8A-G. (SEQ ID NOS:6-79) or homologs thereof, which GPR 
polypeptides do not occur naturally, and/or which are provided in an 
isolated and/or purified form not found in nature. 



o o 

WO 94/05695 PCT/L'S93/08528 



19 



15 



Consensus peptides of GPR polypeptides of the present 
invention may include peptides which are distinct from known GPR 
sequences in critical structural features, but which are derived from 
consensus sequences of homologous GPR transmembrane domains 1-7, 
5 e.g., as presented in Fig. 8A-G (SEQ ID NOS.-6-79). Such consensus 
peptides may be derived by molecular modeling, optionally combined 
with hydrophobicity analysis and/or fitting to model helices, as 
non-limiting examples. Such modeling can be accomplished according 
to known method steps using known modeling algorithms, such as, but 
10 not limited to, ECEPP, INSIGHT, DISCOVER, CHEM-DRAW, AMBER, FRODO and 
CHEM-X. Such algorithms compare transmembrane domains between 
related G-protein coupled receptors, determine probable energy - 
miminized structures and define alternative consensus polypeptide 
fragments . 

Such consensus peptides or fragments of GPRs may then be 
synthesized or produced recombinantly , in order to provide GPR 
polypeptides according to the present invention which mimic, modulate 
or inhibit binding of ligands to G-protein coupled receptors. GPR 
ligands, in the context of the present invention, refer to biological 
molecules that bind GPRs in vitro, in situ or in vivo, and may 
include hormones, neurotransmitters, viruses or receptor binding 
domains, thereof, opsins, rhodopsins, nucleosides, nucleotides, 
coagulation cascade factors, odorants or pheremones, toxins, colony 
stimulating factors, platelet activating factors, neuroactive 
peptides, neurohumors, or any biologically active compounds, such as 
drugs or synthetic or naturally occurring compounds. 

The following non- limiting examples of consensus peptides 
of GPRs of the present invention are provided by way of guidance and 
not by way of limitation. In GPR polypeptides of the present 
invention, one or more, preferably 4-10, Asp and/or Lys residues may 
additionally be incorporated at the carboxy and/or amino terminal 
ends in order to provide expected helix forming effects of the helix 
dipole effect, e.g., as described in Baldwin et al Biochem. 28:2130 
(1989); Baldwin et al Proc . Mat 'I Acad. Sci . USA 84:8898 MQB-n. R nd 
13 Baldwin et al Proc. Nafl Acad. Sci. USA 86:5286 (1989), which 
references are entirely incorporated herein by reference. 
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As a non- limiting example of GPR polypeptide of the present 
invention, dopamine receptor transmembrane fragments of D. 
transmembrane domain (e.g., domain III) as presented in Fig. 2 (SEQ 
ID NO: 2) or a consensus sequence as presented in Fig. 3 (SEQ ID 
5 NO:3), e.g., of D : domains I-VII. Additionally or alternatively a 
consensus sequence may include less than 20 amino acids, such as 15 
amino acids corresponding to a transmembrane domain, such as a D^ 
receptor domain, as presented in Fig. 4 (SEQ ID NO : 4 ) as polypeptide 
IV, which is smaller than the length required by spanning an average 

10 lipid bilayer of a cell membrane. 

However, in the context of the present invention, GPR 
polypeptides of greater than 15 -20 amino acids are preferred such 
that the GPR polypeptides are able to span the lipid bilayer. 

Another non- limiting example of a GPR polypeptide using 

15 dopamine receptor transmembrane domains is a consensus sequence of 
two or more GPR receptors, such as the dopamine D, and D 2 receptors. 
A non- limiting example of such a consensus GPR polypeptide is 
presented in Fig. 5 (SEQ ID NO: 5) . 

Additionally, modified amino acids or chemical derivatives 

20 of amino acids of consensus or fragments of GPRs proteins, according 
to the present invention may be provided, which polypeptides contain 
additional chemical moieties or modified amino acids not normally a 
part of- the protein. Covalent modifications of the peptide are thus 
included within the scope of the present invention. Such 

25 modifications may be introduced into a GPR polypeptide by reacting 
targeted amino acid residues of the polypeptide with an organic 
derivatizing agent that is capable of reacting with selected side 
chains or terminal residues. The following examples of chemical 
derivatives are provided by way of illustration and not by way of 

30 limitation. 

Aromatic amino acids may be replaced with D- or 
L-naphylalanine, D- or L-Phenylglycine, D- or L- 2 - thieneylalanine, 
D- or L-1-, 2-, 3- or 4 -pyreneylalanine , D- or L- 3 -thieneylalanine, 
D- or L- (2 -pyridinyl.) -alanine, D- or L- (3 -pyridine 1) -alanine, D- or 
3? L- (2-pyrazinyl) -alanine, D- or L- (4 - isopropyl) -phenylglycine, 
D- (trifluoromethyl) -phenylglycine, D- (trif luoromethyl) -phenylalanine, 
D- p- f luorophenylalanine , D- or L-p-biphenylphenylalanine , D- or 
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L-p-methoxybiphenylphenylalanine, D- or L-2 - indole {alkyl) alanines , 
and D- or L-alkyiainines where alkyl may be substituted or 
unsubstituted methyl, ethyl, propyl, hexyl , butyl, pentyl , iso- 
propyl, iso-butyl, sec-isotyl, iso-pentyl, non-acidic amino acids, 
5 cf C1-C20. 

Acidic amino acids can be substituted with non- carboxyiate 
amino acids while maintaining a negative charge, and derivatives or 
analogs thereof, such as the non-limiting examples of (phosphono) - 
alanine, glycine, leucine, isoleucine, threonine, or serine; or 

10 sulfated (e.g., -S0 3 H) threonine, serine, tyrosine. 

Other substitutions may include unnatural hyroxylated amino 
acids may made by combining "alkyl" (as defined and exemplified 
herein) with any natural amino acid. Basic amino acids may be 
substituted with alkyl groups at any position of the naturally 

15 occurring amino acids lysine, arginine, ornithine, citrulline, or 
(guanidino) -acetic acid, or other (guanidino) alkyl -acetic acids, 
where "alkyl" is define as above. Nitrile derivatives (e.g., 
containing the CN-moiety in place of COOH) may also be substituted 
for asparagine or glutamine, and methionine sulfoxide may be 

20 substituted for methionine. Methods of preparation of such peptide 
derivatives are well known to one skilled in the art. 

In addition, any amide linkage in any of the GPR 
polypeptides can be replaced by a ketomethylene moiety, e.g. (-C(=0) - 
CH 2 -) for ( - (C«0) -NH- ) . Such derivatives are expected to have the 

25 property of increased stability to degradation by. enzymes, and 
therefore possess advantages for the formulation of compounds which 
may have increased in vivo half lives, as administered by oral, 
intravenous, intramuscular, intraperitoneal, topical, rectal, 
intraocular, or other routes. 

30 In addition, any amino acid representing a component of the 

said peptides can be replaced by the same amino acid but of the 
opposite chirality. Thus, any amino acid naturally occurring in the 
L- configuration (which may also be referred to as the R or S , 
depending upon the structure of th e rhpmiral pnriryi may hn r^plnr?^ 

35 with an amino acid of the same chemical structural type, but of the 
opposite chirality, generally referred to as the D- amino acid but 
which can additionally be referred to as the R- or the S-, depending 
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upon its composition and chemical configuration. Such derivatives 
have the property of greatly increased stability to degradation by 
enzymes, and therefore are advantageous in the formulation of 
compounds which may have longer in vivo half lives, when 
5 administered by oral, intravenous, intramuscular, intraperitoneal, 
topical, rectal, intraocular, or other routes. 

Additional amino acid modifications of amino acids of GPR 
polypeptides of to the present invention may include the following: 
Cysteinyl residues may be reacted with alpha -haloacetates (and 

10 corresponding amines), such as 2 -chloroacetic acid or 
chloroacetamide, to give carboxymethyl or carboxyamidomethyl 
derivatives. Cysteinyl residues may also be derivatized by reaction 
with compounds such as bromotrif luoroacetone, alpha-bromo- 
beta- (5 -imidozoyl) propionic acid, chloroacetyl phosphate, 

15 N-alkylmaleimides, 3 -nitro-2-pyridyl disulfide, methyl 2-pyridyl 
disulfide , p- chloromercuribenzoate , 2 - chloromercuri - 4 -ni trophenol , 
or chloro-7-nitrobenzo-2-oxa-l, 3-diazole. 

Histidyl residues may be derivatized by reaction with 
compounds such as diethylprocarbonate e.g., at pH 5.5-7.0 because 

20 this agent is relatively specific for the histidyl side chain, and 
para-bromophenacyl bromide may also be used; e.g. , where the reaction 
is preferably performed in 0.1 M sodium cacodylate at pH 6.0. 

Lysinyl and amino terminal residues may be reacted with 
compounds such as succinic or other carboxylic acid anhydrides. 

25 Derivatization with these agents is expected to have the effect of 
reversing the charge of the lysinyl residues. ' Other suitable 
reagents for derivatizing alpha -amino -containing residues include 
compounds such as imidoesters/e .g. , as methyl picolinimidate; 
pyridoxal phosphate ; pyridoxal ; chl o rob or ohydr ide ; 

30 trinitrobenzenesulf onic acid; O-methylisourea ? 2,4 pentanedione ; and 
transaminase- catalyzed reaction with glyoxylate. 

Arginyl residues may be modified by reaction with one or 
several conventional reagents, among them phenylglyoxal , 
2 , 3 -butanedione, l , 2 - cyclohexanedione , and ninhydrin according to 

3 5 known method steps. Derivatization of arginine residues requires 
that the reaction be performed in alkaline conditions because of the 
high pKa of the guanidine functional group.. Furthermore, these 
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reagents may react with the groups of lysine as well as the arginine 
epsilon- amino group. 

The specific modification of tyrosyl residues p er se is 
well-known, such as for introducing spectral labels into tyrosyl 
5 residues by reaction with aromatic diazonium compounds or 
tetranitromethane . N-acetylimidizol and tetranitromethane may be 
used to form 0-acetyl tyrosyl species and 3-nitro derivatives., 
respectively. 

Carboxyl side groups (aspartyl or glutamyl) may be selec- 

10 tively modified by reaction with carbodiimides (R' N-C-N-R*) such as 
1-cyclohexyl- 3 - (2 -morpholinyl - (4-ethyl) carbodiimide or 1- 
ethyl-3- ( 4 - azonia- 4 , 4 - dimethylpentyl ) carbodiimide. Furthermore, 
aspartyl and glutamyl residues may be converted to asparaginyl and 
glutaminyl residues by reaction with ammonium ions'. 

15 Glutaminyl and asparaginyl residues may be frequently 

deamidated to the corresponding glutamyl and aspartyl residues. 
Alternatively, these residues may be deamidated under mildly acidic 
conditions. Either form of these residues falls within the scope of 
the present invention. 

20 Derivatization with bifunctional agents is useful for 

cross -linking the peptide to a water- insoluble support matrix or to 
other macromolecular carriers, according to known method steps. 
Commonly used cross - linking agents include, e.g., 
1, l-bis (diazoacetyl) -2-phenylethane, glutaraldehyde, 

25 N-hydroxysuccinimide esters, for example, esters with 
4-azidosalicylic acid, homobif unctional imidoesters, including 
disuccinimidyl esters such as 3,3'- 
dithiobis (succinimidylpropionate) , and bifunctional maleimides such 
as bis-N-maleimido- 1 , 8-octane. Derivatizing agents such as 

3 0 methyl - 3 - [ (p - azidophenyl ) dithio] propioimidate yield phot oactiva table 
intermediates that are capable of forming crosslinks in the presence 
of light. Alternatively, reactive water- insoluble matrices such as 
cyanogen bromide-activated carbohydrates and the reactive substrates 
described in U.S . Patent — Uos 3,969,287; — 3 , 601, 01G ; — 4 , 195 , 128 ; 

35 4,247,642; 4,229,537; and 4,330,440 (which are herein incorporated 
entirely by reference) , may be employed for protein immobilization. 
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Other modifications of GPR polypeptides of the present 
invention may include hydroxylation of proline and lysine, 
phosphorylation of hydroxyl groups of seryl or threonyl residues, 
methylation of the alpha-amino groups of lysine, arginine, and 
5 histidine side chains (T.E. Creighton, Proteins: Structure and 
Molecule Properties, W.H. Freeman & Co., San Francisco, pp. 79-86 
(1983)), acetylation of the N-terminal amine, methylation of main 
chain amide residues (or substitution with N-methyl amino acids) and, 
in some instances, amidation of the C- terminal carboxyl groups, 

10 according to known method steps. 

Such derivatized moieties may improve the solubility, 
absorption, permeability across the blood brain barrier biological 
half life, and the like. Such moieties or modifications of GPR 
polypeptides may alternatively eliminate or attenuate any possible 

15 undesirable side effect of the protein and the like. Moieties 
capable of mediating such effects are disclosed for example, in 
Remington' s Pharmaceutical Sciences, 16th ed. , Mack Publishing Co., 
Easton, PA (1980) . 

Such chemical derivatives of GPR polypeptides also may 

20 provide attachment to solid supports, including but not limited to, 
agarose, cellulose, hollow fibers, or other polymeric carbohydrates 
such as agarose, cellulose, such as for purification, generation of 
antibodies or cloning; or to provide altered physical properties, 
such as resistance to enzymatic degradation or increased binding 

25 affinity or modulation for GPRs, which is desired for therapeutic 
compositions comprising GPR polypeptides, antibodies thereto or 
fragments thereof. Such peptide derivatives are well-known in the 
art, as well as method steps for making such derivatives using 
carbodiimides active esters of N- hydroxy succinimmide , or mixed 

3 0 anhydrides, as non- limiting examples. 

Variation upon consensus peptide sequences of GPR 
polypeptide of the present invention may also include: the addition 
of one, two, three, four, or five lysine, arginine or other basic 
residues added to the -C00H terminal end of the peptide; and/or one, 

3 5 two, three, four, or five glutamate or aspartate or other acidic 
residues added to the amino terminal end of the peptide, where 
"acidic" and "basic" are as defined herein. Such modifications are 
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well known to increase the a- helical content of the peptide by the 
"helix dipole effect". They also can provide enhanced aqueous 
solubility of the peptide. See, e.g., Baldwin et al . , supra 

As another non- limiting example of a GPR polypeptide of the 
5 present invention, serotonergic receptors (5-HT) consensus sequences 
may be determined using presently known 5-HT sequences and include, 
e.g., as consensus peptides of TM3 , TM5 and TM7, • espectively : 

5-HT consensus (1) DDDDNI WS I FDWI GYLNS I SMVI YTLFKKKK (SEQ ID NO: 80) 
5-HT consensus (2) DDDDNI WNI FSTIGYLNS IS PVS VIMHI YGKKKK (SEQ ID NO: 81) 
10 5-HT consensus (3) DDDDGYS I YDTLVTFAINPVYITVFKKKK (SEQ ID NO: 82) 

Such non-naturally occurring consensus sequences may also 
be further modified according to known method steps to provide 
additional consensus peptides with substituted amino acids to 
increase or decrease a-helical propensity and/or solubility (e.g., 
15 hydrophilicity) . As a non-limiting example, 5-HT consensus peptide 
(1) above may be modified according to the present invention to have 
increase helical propensity and increased aqueous solubility as 
follows : 

5-HT consensus (4) DDDDNAWSAFDWALYLNSISMAIYTYAKKKK (SEQ ID NO:83), 

20 wherein, e.g., smaller, non-polar residues replace either larger, 
more polar residues (e.g., Ala for lie or Val) or larger aromatic 
residues (e.g., Ala for Phe) . 

Another non- limiting , illustrative example of consensus GPR 
polypeptides of the present invention are those for adrenergic 
25 receptors, are the following: 

An example of the consensus GPR polypeptide for 
domain VII across all presently known adrenergic receptors is as 
follows : 

adrenergic consensus U> LFSFITWLGYANSSLNPIIYTTF (SEQ ID NO: 84) 



3 0 An example of a consensus GPR polypeptide for domain V 

across all adrenergic receptors is as follows: 
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adrenergic consensus (2) VYTIYSSSWFFAPSLAIMVITYT (SEQ ID NO:85) 

Examples of a consensus GPR polypeptide for 
domain III across all adrenergic receptors are as follows: 

adrenergic consensus (3) IWLTSDIMSTSSILHNLCVISF (SEQ ID NO:8b) 



5 An example of a consensus GPR polypeptide for domains III, 

V, and VII of all adrenergic receptors is as follows: 

adrenergic consensus (4) IWSIFSSDIVVGYANHSSLAIMCPIVIYTV (SEQ ID NO:87) 
adrenergic consensus (5) IFTIFSSDIAVGYANHSSAAIMPIVIYSV (SEQ ID NO:88), 

Wherein variations and substitutions of amino acids may be made as 
10 described herein. 

Non- limiting examples of consensus GPR polypeptides for 
transmembrane domain III across several or many, such as 1-500, or 
any range or value therein, G-protein receptors are as follows: 

TM3-U) YAI FVLYASAWLS FLNCP FIVTLNI (SEQ ID NO: 96) 
15 TM3-(2) YAI FVLYATAWLS FLNCP FIVTLNI ( S EQ ID NO:97) 
TM3-(3) YAI FVLYATAWLTFLNCPFIVTLNI ( SEQ ID NO:98) 
TM3-<4) YAIFVLYASAWLTFLNCPFIVTLNI (SEQ ID NO:99) 
TM3- (5) WAI FVLYASAWLS FLNCPFIVTLNI (SEQ ID NO:100) 
TM3-(6) WAI FVLYATAWLSFLNCPFI VTLNI ( SEQ ID NO: 101) 
2 0 TM3-(7) WAI FVLYATAWLTFLNCP FIVTLNI ( SEQ ID NO: 102) 
TM3-(8) WAI FVLYASAWLTFLNCPFIVTLNI ( SEQ ID NO : 1 0 3 ) 
TM3 - ( 9 ) YAVFVLYAS AWLS FLNMP F I VTLNI (SEQ ID NO : 104 ) 
TM3-(10) YAVFVLYATAWLSFLNMPFIVTLNI (SEQ ID NO:105) 
YAVFVLYATAWLTFLNMP FIVTLNI (SEQ ID NO: 106) 
YAVFVLYAS AWLTFLNMP F I VTLNI ( SEQ ID NO : 1 0 7 ) 
YAI FVLYASAWLS FLNCVTAS I PFI VTLNI (SEQ ID NO:108) 
YAI FVLYASAWLS FLNCTSS I WTAS I VTLNI (SEQ ID NO: 109) 
YAIFVLYASAWLSFLNVTLNICTSSIV (SEQ ID NO: 110) 
YAI FVLYASAWLS FLNTAS I LNLMF I VTLNI (SEQ ID NO:lll) 
YAI FVLYASAWLS FLNMAS I LNLPFI VTLNI (SEQ ID NO: 112) 
YAI FVLYASAWLS FLNSGILLLAPFI VTLNI (SEQ ID NO: 113) 
YAI FVLYASAWLS FLNMSGILLLAPFIVTLNI (SEQ ID NO: 114) 
YAIFVLYASAWLfFLNSELSVYTLTVCPFIVTLNI (SEQ ID NO: 115) 
YAI FVLYASAWLS FLNMSELSVYTLTVP FIVTLNI (SEQ ID NO: 116) 



TM3 - (11 
25 TM3-(12 
TM3- (13 
TM3- (14 
TM3- (15 
TM3 - (16 
30 TM3-(17 
TM3- (18 
TM3- (19 
TM3- (20 
TM3- (21 
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TM3 - 


(22) 


YATFVT.YACAWT.ACFT CW*t*T TVCFT MPPFT1TTT WT / crn m M,n .ml 




TM3 - 


(23 ) 


YATPVT.Y21Q2iWT.il CFT CWTT TTTDFTirTT WT f CFO TD MH ■ 1 1 Q \ 




TM3 - 


(24 ) 


YATFVT/YACAUT.CPT.ACFT.CWACPT C CTT 7 U T'V7MMT3PTVTT MT (crn Tn MH • i i Q \ 




TM3 - 


(25 ) 


YATFVLYASAWT^gFT^fV^ETai.WgT.rPFTVTT.NT f CFO TD wn- 1 ?f)l 
■A 'va f vu i nn rt n tin r xmvjuci xiujnoipjurr x v x xjXi x i o xxv w . x x* u / 


5 


TM3 - 




YATFVT.VAC AWT CCT WfV2FTAT UCT TVTT WT / CPO TH MO • 1 0 1 1 
l/iir V JjXiuiiniijf J-iTlVjO&XAXjnoXjX V i XxH X V o£*L^ XU iNL/ . x x> X J 




TM3 - 


f 27 ) 


YATFVT/YAC AWTi"2riPT AT UCT wr'DlTTirrT WT / CTTH TTi WO • 1 0 0 \ 




TM3 - 


f 2 ft ) 


Y2iTP*\7T.VZi(^/TlirTIiT UCT CCT wnOITTTrPT WT / CCfl TF» WO . 1 "5 \ 




TT13 - 


( 29 ) 


YATTT\7T,Y2igIiWT CTTTTTTT T imVT J^WTTT T WOOt? TT7TT WT / CCO TT» MO .1 Oil 




TM3 - 




Y2iTTT\7T.V2i CHUT CUTTT T tmVT/^WCT T DCTVTT WT / CPO TPi MO . T5C1 


10 


TM3 - 


( 31 ) 


YATFVT.Y2iCIiUT.CFT.W*r2ir*PVV2iTTIlCT.PPTTTrTBT TDPTVTT WT ^ Cm TTi • n C 1 




TM3 - 


(32 ) 


YAIFVLYA*?AUT,TACPYVATTACT J r'FTTPT2^ Tn wn- 1 




TM3 - 


(33) 


YAIFVLYATACFYVAITACLrFTTEIALTSFT^ { QFO TD wo - np) 




TM3 - 


(34 ) 


YATTACFYVATTAgT.rFTTFTATjTAgAWT.gFT.MrPFTVTT^WT / CFO TH WO ■ ^ 0Q\ 
x ax ^ au c x v/tx iiwxiWf x x ox/ujiiunnxjgr xjxiv^x^ r x v x xjXi x \ j&y xu xml/ . x ^ 3 1 




TM3 - 


(35) 


YAIFVLYATACFYVATTTFTAT.TSAWT^gFT^rpFTVTT^N ( CFO TD wo .nn) 
•*»x * » xj lAi/tcr x v/ix lid xjtvxj x Jnn xjo f xjxi^it r x v x xjx« x \ Od^ x u vt\j , i J u J 


15 


TM3 - 


(36 ) 


YAIFVLYASAWTjCFTJ^APFYTrTjFAnvrFT.TPFTVTT.NT f CFO TD WO - 1 "3 1 1 
***x r v xj i/wnn xjo f xjxv\^f x x ^> xj c mvj v l r juirr xv x ' x v o XU iiu . l J X ) 




TM3 - 


( 37 ) 


YATFVTiYACAWWAr'FYTOT.FArtVMFT.TT CFT WPPFTVTT WT ( CFO Tn MO - t *3 o \ 
XirkXir v XJl/witniiMUr X X v_ Xj r Mu V I*XT XiXXjOT XxMl~Jr r X v X xxU X lody XU INL/.XJ^./ 




TM3 - 


/ -a q \ 
\ J o t 


YA TF\rT VrVTTT P2if2170FT T T\ C MJT C UT W^DPTIH'T MT / PTA Tr> wr\ t -5 n \ 

xMXr vXjX r I XL-Ur Au VLf UXAoAnUor lxNL.rr X v i XxLNX \ oay XU NU : 1 j J ) 




X i. JO * 




lAlr VLiAaVLJAVNMf ibAWLbr LNLPr IVTLTI1 taEQ ID NO : 134; 




1 X*XJ 




xAXr o vUAVWx^ I vijiAiAWijor ID NO:135) 


2 0 




} 


lAlr VUiAoAWUS VlJAVNr^ xS FIjjNCPFI vTLNI (SKQ ID NO : 13 6 ) 




xru 


\ ) 


XAxr VUXAaAWlibr JaNSVOAvwrll' I PF I v rjjNI (SEQ ID NO:137) 




TM1 _ 
1 1x3 




xAxr VxjlAiAWIjbFxxNCPFIVS VUAVNBIFTTLNI \ SEQ ID NO: 13 8) 




X J7U 




XAxr VijXA^aAnijSVlJKU* iSFIiNCPFI v ixJ«I (SEQ ID NO: 139) 








VRT TT\ A T rXTtUf PT TT V A O AWT PPT MPnPTTJ*FT »TT / C TTT\ TA « i n \ 

I AI b VUAVN^i r VL i AoAWLS r LNL PF I vi LNI (SEQ ID NO: 140) 


25 


TM3 - 




VR T1TCT Cin?OT T TV TT7T VA O AtiTT CPT MPBrTTJTT MT / C T*r\ TA vr/*> i j« t \ 

* Air oxjo Vr oXxLjAX VIjI AoArTLbr XjNLPrlVxIxNI (SEQ ID NO:141) 




TM3 - 


f4 7 1 


V7A T CTrr vi\ct ctttj ct tat c a in en MroPT^ri'i mt / ep^ tts ma i >i *i \ 
x Air vXji AoLovr oi^JjAIbAWLbr IxNuPr I VTjUNI ( bEQ ID NO:142) 




TM3 - 


f 4 fl 1 
v •* O J 


VTATTTTfT VTA C7AUT CT CTiTTCT TAT OPT HnBPTtJTT WT ( OPO TH i >• -\ \ 

xAXr VLiAaAWiibxio Vr oixLAXbr UNUPr I VTLNI 1 bEQ ID NO:143) 




TM3 - 




YATFVT.VAC TAUT CTTT CT CT7T7CT T R TWOTJfT^TTT WT / CPH TT» Ktr\ . T A a \ 

xAxr vxjXAoAnijor UoXtb Vr bxxUAXJNUirr X v xIjlINI tb&<^ XU NU:X44/ 




T143 - 




YTATFT7T V7ACTAWT CPT MTJPCT CTJTCT TAT TT PT ' T XTT / PPrt Tn Wrt . i ji C\ 

* Air v JjIAoAniior ixMirr bUb Vr bxxLjAXX v x ixNX \ ba^ ID NO: 14b) 


30 


TM3 - 




x Axr viiXAXAVfijX r XjJXl. v xax Xirr X v i XjINX \ bby XU NU: 14b] 




TM3 - 




iAir vxiXAXAwixor Xjxnl. l bbx v v xAl x v I lxNi \ bay ID NU:147/ 




TM3 - 




XAxr vuxAXAWUbr iJN vxlxNlLTl TTI V ( SEQ ID NO: 14 8) 








iAXr Vx^xATAWljTFxJirATILNIJlFI (SEQ ID NO : 149 ) 




*TM3 - 




VTA T TT V AT^ATiTT OPT TkTVTA TTT XTT TJPTTWT WT IPPA TA »Tf"\ ^ c \ 

i Axr vljiAXAWxjbr XjxxMAriXiNIjPF I VTIjIMI \ SEQ ID NO: 150) 


35 


TM3 - 


(56) 


YAI FVX. YATAWLTFLNS G I lxIxlAP F I VTLN I ( S EQ ID N0:151) 




TM3 - 


f ^7 ) 


YATFVT VAC RUT TFT WMTV^TT T T TAOPTTTTT WT icrn Tn mn , t c*)l 

xAxr vXiXAoAnXix r Xxlnfi ioi ijXjixAjr r xvi IjIN l ( bhtQ XU NU : lbx ) 




TM3 - 




VA TPT7T VTA C A UJT TTT MTPT T*T rVTT <nrPrj PTT7TT MT i CPA rn vr/~\ i c- -s \ 
IAXr VliXAbAnijl r IxN Xc.1j1 V X livl VLr r 1 VTLiNI ( bEQ ID NO.153) 




TM3 - 


( ^9 } 


VA T T7\7T VA C AUT TCT WNt*T*PT TTrVTT TT TO m TTT MT / Cm TA i c >i \ 

i Axr vxjXAoawui r xxwrai txjivx ixjiVirr xv ix^INX v bc»y XU NU : 154 ) 




TWO 


( 60 ) 


YAI FVLYATAWIATE LTVYTXTVTFLNC P F I VTLNI (SEQ ID NO:155) 


40 


TM3- 


(61) 


YAI F*VLYASAVTIxATELS VTTLTW (SEQ ID NO: 156) 




TM3- 


(62) 


YAIFVLYATAVaSFIATELSVYASELS^ wn. ic ^ 




TM3- 


(63) 


YAIFVLYATAWLSFI^GGELALWTXCPFIVTLNI (SEQ ID NO: 158) 




TM3- 


(64) 


YAIFVLYASAWXTFIxNGGEIALWTLI VTLNI (SEQ ID NO: 159) 




TM3 - 


(65) 


YAI FVLYAS AWLGGE I ALWTLNC P F I VTLNI (SEQ ID NO: 160) 


45 


TM3 - 


(66) 


YAI FVLYAGGE I AL WTLS FLNC P F I VTLNI (SEQ ID NO: 161) 
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TM3- 


(67) 


YAI FVLYATAWLSFFFLLFGYLGNFLLNCPFI VTLNI (SEQ ID NO: 162) 




TM3- 


(68) 


YAIFVLYATAWLFFFLLFGYLGNFLLPFIVTLNI (SEQ ID NO: 163) 




TM3- 


(69) 


YAI FVLYATAWLTFLNTACFYVAITASLCF ITE IALI P F I VTLNI ( SEQ ID NO: 164 ) 




TM3- 


(70) 


YAI FVLYATAWLTACFYVAITATLCFITE IALI CP F IVTLNI (SEQ ID NO: 165 ) 


5 


TM3- 


(71) 


YAIFVLYATACFYVAITATLCFITEIALISFLNCPFIVTLNI (SEQ ID NO: 166) 




TM3- 


(72) 


YAITAC FYVAI TAS I*CF ITE IALIATAWLTFLNCP F I VTLNI (SEQ ID NO: 167) 




'PM3- 


(73) 


YAI FVLYATACFYVAI I TE IALITAWLTFLNCP F I VTLNI (SEQ ID NO: 168) 




TM3- 


(74) 


YAI FVLYAS AWLTFLNACFYI CLFAGVCFLI P F IVTLNI (SEQ ID NO : 1 6 9 ) 




TM3- 


(75) 


YAI FVLYASAWHACFYICLFAGVMFLILTFLNCPF IVTLNI ( SEQ ID NO: 170) 


10 


TM3- 


(76) 


YAI FVLYFYI CLFAGVCFLIATAWLTFLNCP FI VTLNI ( SEQ ID NO : 1 7 1 ) 




TM3- 


(77) 


YAI FVLYATVDAVNMFTTAWLTFLNCPFI VTLNI (SEQ ID NO: 172) 




TM3- 


(78) 


YAI FTVDAVNMFTVLYATAWLTFLNCP F IVTLNI ( S EQ ID NO: 173) 




TM3- 


(79) 


YAI FVL YATAWLTVDAVNMFTS FLNCPF I VTLNI ( S EQ ID NO: 174) 




TM3 - 


(80) 


YAI FVLYATAWLSFLNTVDAVNMFTPFI VTLNI (SEQ ID NO: 175) 


15 


TM3- 


(81) 


YAIFVLYASAWLTFLNCPFIVSVDAVNMFTTLNI (SEQ ID NO: 176) 




TM3- 


(82) 


YAI FVLYATAWLSVDMFTTFLNCP F IVTLNI ( S EQ ID NO: 177) 




TM3- 


(83) 


YAI SVDAVNMFTFVLYATAWLSFLNCPFI VTLNI (SEQ ID NO: 17 8) 




TM3- 


(84) 


YAIFVLYASLTVFSLLAISAWLTFLNCPFIVTLNI (SEQ ID NO: 179) 




TM3- 


(85) 


YAI FVLYASAWLTLS VFTLLAI S FLNCP F IVTLNI ( SEQ ID NO: 180) 


20 


TM3- 


(86) 


YAI FVL YASAWLTFLS LSVFTLLAINCP F IVTLNI ( S EQ ID NO: 181) 




TM3- 


(87) 


YAIFVLYASAWLTFLNPFSLSVFSLLAI IVTLNI (SEQ ID NO:182) 




TM3 - 


(88) 


YAI FVL YASAWLSFLNLGGVTASFTASVGPFI VTLNI (SEQ ID NO: 183) 




TM3- 


(89) 


YAI FVLYAS AWLS FLNLGGVTAS FTAS VGVTLNI (SEQ ID NO: 184) 




TM3- 


(90) 


YAI FVLLGGVTAS FTASVNYASAWLS FLNCP FI VTLNI ( SEQ ID NO : 1 8 5 ) 


25 


TM3- 


(91) 


YAIFVLYAIFFFLLFSAWLSFLNCPFIVTLNI (SEQ ID NO: 186) 




TM3- 


(92) 


YAI F"VLYASAWLS FLNCPFI VTLNI I FFFLLFI VTLNI (SEQ ID NO: 187) 




TM3- 


(93) 


YAI FVLYASAWI FFFLLFLS FLNCP F IVTLNI ( SEQ ID NO: 188) 




TM3- 


(94) 


YAIFVLYASAV^FFTVLASELSVYTLTVSFLNCPFIVTLNI (SEQ ID NO: 189) 




TM3- 


(95) 


YAI FVL YASAWLS F LFATLGGE IALC P F IVTLNI ( SEQ ID NO: 190) 


30 


TM3- 


(96) 


YAI FVLYAFATLGGEIALSAWLS FLNCPF IVTLNI (SEQ ID NO: 191) 




TM3- 


(97) 


YAI FFTV1ASELSVYTLTVYASAWLS FLNCPF IVTLNI (SEQ ID NO: 192) 




TM3- 


(98) 


YAIFFPIAALFASIASAWLSFLNCPFIVTLNI (SEQ ID NO: 193) 




TM3- 


(99) 


YAIFVLYASAWLSFFPIAALFASIPFIVTLNI (SEQ ID NO: 194) 




TM3- 


(100] 


YAIFVLYASAWLSFLNCPFFPIAALFASILNI (SEQ ID NO: 195) 


35 


TM3- 


(101 


YAI FVLYASAWLSLDVLFSTASIMHLSFLNGGEIALWS LI VTLNI ( SEQ ID NO: 196) 




TM3- 


102) 


YAIFVLYASLDVLFSTASIMHLIALWSLNCPFIVTLNI (SEQ ID NO:197) 




TM3- 


103) 


YAIFVLYAGGEIALWSLSFLNSLDVLFSTASIMHLPFIVTLNI (SEQ ID NO: 198) 




TM3- 


104) 


YAI FVLYASAWLSFFDVLFSTASIMHLFGYLGNFLLNCPF IVTLNI (SEQ ID NO: 199) 




TM3- 


105) 


YAI FVL YASAWLFFFLLFGYLSLDVLFSTASIMHLGNFLLPFI VTLNI (SEQ ID NO: 200) 


40 


TM3- 


106) 


YAI FVLYASAWLSFLNTACFYVAITASLSLMHLFITE IALI PFIVTLNI ( SEQ ID NO: 201) 




TM3- 


107) 


YASLDVLFSTAIMHLSAWLTACFYVAITASLCFITE IALI CP FI VTLNI ( SEQ ID NO: 202) 




TM3- 


108) 


YAI FVLYATACFYVAI TAS LS FLNCPF IVTLNI SLDVLFSTASIMHL ( SEQ ID NO: 203) 




TM3- 


109) 


YAITACFYVAITASLCFITEIALIASAWLSFLNCPFIVTLNI (SEQ ID NO: 204) 




TM3- 


110) 


YAI FVLYATACFYSTAS I LNLIMHLCAI S LVAI ITE IALI S AWLS FLN ( S EQ ID NO: 205) 


45 


TM3- 


111) 


YAI FVLYASAWLSFLNACFYICLFASILNLIMHLGVCFLI PFIVTLNI OSEQ ID NO: 206) 



> 
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TM3-I112) YAI FVLYASAWNAS I LNLIMHLCFYI CLFAGVMLI LS FLNC P F I VTLNI ( S EQ ID NO:207) 

TM3-I113) YAI FP FVQ CWS I FS LVL IAWLYF Y I AGVC F L I AS AWLS FLNC P F I VT I ( S E Q ID NO:208) 

TM3-(114) PFVQCVSITVSIFSLVLIAVYAIFVLYASVDAVNMFTSAWCPFIVTLNI (SEQ ID NO:209) 

TM3-U15) YAIFGDWSSVDAVNMFTVLYASAWLSFLNCPFIVTLNI (SEQ ID NO:210) 

5 TM3-(116) YAIFVLYAGDWSSAWLSVDAVNMFTSFLNCPFIVTLNI (SEQ ID NO: 211) 

TM3-(117) YAIFVLYASAWLGDWSSFLNSVDAVNMFTPFIVTLNI (SEQ ID NO:212) 

TM3-(118) YAIFVLYASAWLSFLNCPFIVGDWSSVDAVNMFTTLNI (SEQ ID NO: 213) 

TM3-(119) YAIFVLYASAWLGYLGSVDMFTSFLNCPFIVTGDWSLNI (SEQ ID NO: 214) 

TM3-U20) YAI S VDAVNMFTFVLYAGYLGSAWLS FLNCP F I VTLNI ( S EQ ID NO:215) 

10 TM3-U21) YAIFSLSVFSLLAIVLYASAWLGYLGSFLNCPFIVTLNI (SEQ ID NO:216) 

TM3-(122) YAI FVL YAG YLGAGNMD SLSVFSLLAISAWLS FLNCP F I VTLNI ( SEQ ID NO:217) 

TM3-(123) YAIFVLYASAWLSLSVFGNMSLLAISFLNCPFIVTLNI (SEQ ID NO:218) 

TM3-{124) YAI FVLYASAWLSFLSLSVFGGSLLAINCPFI VTLNI (SEQ ID N0:219) 

TM3-(125) YAI FVLYAS AWLS FLNPFSLSVFGSLLAI I VTLNI (SEQ ID NO:220) 

15 TM3-U26) YAI FVLYATAWLTFLS LANCVTATI PFI VTLNI ( S EQ ID NO:221) 

TM3-(127) YAI FVLYATAWLSFLNCTS LASS I WTATI VTLNI (SEQ IDN0:222; 

TM3-U28) YAI FVL YATAWLS FLNVTLNI S LACTTT I V (SEQ ID NO:223) 

TM3-U29) YAIFVLYATAWLTFLNTATILSLANLMFIVTLNI (SEQ ID NO:224) 

TM3-(130) YAI FVL YATAWLS FLNMAT I LNLP F S VDAVI VTLN I ( S EQ ID NO:225) 



20 Recently discovered G-proteins also can be used according 

to the presently claimed invention to provide GPR polypeptides of the 
present invention, based on the teaching and guidance presented 
herein. Exampled of such GPR polypeptides of the present invention 
may include, as non- limiting examples, GPR polypeptides corresponding 

25 to transmembrane domain III, e.g., as follows: 



TM3-(131) I STMYTVTGRWTLGQWCDFWLS SD ITCCTAS ILHLCVIAL (SEQ ID NO:226) 

TM3-U32) ILYGYRWPLPSKLCAVWIYLDVLFSTASIMHLCAISL (SEQ ID NO:227) 

TM3-U33) IIYI VMDRWKLGYFLCEVWLSVDMTCCTCS ILHLCVIAL (SEQ ID NO:228) 

TM3-U34) IADKTVRVAMGAENDLGYNFRSDDVCGHCWQWYCSL (SEQ ID NO: 229) 

30 TM3-(13S) ILNYWPFGLALCHFVNYSQAVSVLVSAYTLVAISI (SEQ ID NO:230) 

TM3-U36) I LGRWE FGIHLCKLWLTCDVLCCTS S ILNLCAIALD (SEQ ID NO:23l) 

TM3-(137) IMASVMHRHCLPLIGICLSSERHCLVSIFVELGAL (SEQ ID NO: 23 2) 



Further non- limiting examples of consensus GPR polypeptides 
for transmembrane domain III of several or many, such as 1-500, or 
35 any range or value therein, more recently discovered G-protein 
rprpprnr s are as followo : ■ 



TM3-U38) YAIFVLYASAWLSFLNCPFISILHLCVIALVTLNI (SEQ ID NO:233) 
TM3-C139) YAI FVLYATAWLS FLNCP FIS ILNLCAIALD VTLNI (SEQ ID N0:234) 
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TM3-(140) YAIFVLYATAWLTFLNCPFISIFVELGALVTLNI (SEQ ID NO:235) 

TM3-U41) YAI FVLYASAWLTFLNC P F I S I FVE LS IMHLCAI SLGALVTLNI (SEQ ID NO:236) 

TM3-(142) WAI FVLYAILGRWEFG IHLCKLWLTSAWLS IMHLCAIS LS FLNCP F I VTLNI ( SEQ ID NO: 237} 

TM3- (143) WAI FVLYAI LGRWEFG IHLCKLWLTTAWLS IMHLCAI S LS FLNCP F I VTLN I ( SEQ ID NO: 238) 

5 TM3-<144) WAI FVLYATAWLTFLNCPFS IMHLCAISLIVTLNI ( SEQ ID NO:239> 

TM3-<145) WAI FVLYASAWLTFLNCPFIS IMHLCAI SLVTLNI (SEQ ID NO:240) 

TM3-(146) YAVFVLYASAWLS FLNMS IMHLCAI SLP F IVTLNI ( SEQ ID NO:241) 

TM3-(147) YAVFVLYATAWLSFLNMPFSILNLCAIALDIVTLNI (SEQ ID NO:242) 

TM3-(148) YAVFVLYATAWLS I LNLCAIALDTFLNMPF I VTLNI (SEQ ID NO:243) 

10 TM3-U49) YAVFVLYAS IliNLCAIALD SAWLTFLNMP F IVTLNI (SEQ ID NO:244) 

TM3-U50) YAI FVLYASAWLS FLNCVTAS I PFCLVS I FVELGALI VTLNI (SEQ ID NO:245) 

TM3-(151) YAI FVLYASAWLS FLNCLVS I FVELGALI WTAS IVTLNI (SEQ ID NO:246) 

TM3-(152) YAI FVLYASAWLS FLNVTLNCLVS I FVELGALI I (SEQ ID NO:247) 

TM3-(153) YAI FVLYASAWLS FLNTAS I IiNLMF I CLVS I FVE LGALVTLNI (SEQ ID NO:248) 

15 TM3-U54) YAI FVLYASAWLS FLNMAS I LNL PFCLVS I FVE LGALVTLNI (SEQ ID NO:249) 

TM3- (155) YAI FVLYASAWLS FLNI LGRWE FG IHLCKLWLTCD VLCCTS S G I LLLAP F I VTLNI (SEQ ID NO:250) 

TM3- (156) YAIFVLYASAWLSFLNMILGRWEFGIHLCKLWLTCDVLCCTSSGILLLAPFIVTLNI (SEQ ID NO: 251 ) 

TM3- (157) YAI FVLYASAWLILGRWEFGIHLCKLWLTCDVLCCTSSFLNSELSVYTLTVCPFI VTLNI ( SEQ ID 
NO:252) 

20 TM3- (158) YAI FVLYAI LGRWE FG I HLCKLWLTCD VLCCTS SAWLS FLNMSELS VYTLTVP F IVTLNI (SEQ ID 
NO: 253) 

TM3-Q59) YAIFVLYASAWLASRWPLPLSVYTLTVSFLNCPFIVTLNI (SEQ ID NO:254) 

TM3 - ( 160 ) YAI FVLYASAWLASELI LYYWRWPLPCLHDLVWLCTCS IIjHLCVIALS V CTLTVPF IVTLNI (SEQ ID 

NO:255) 

2 5 TM3 - ( 161 ) YAI FVLYASAWLS FLASELSVYASELSSTLHDLVWLWLDWCVTALTI (SEQ ID 

NO:256) 

TM3 -(162) YAI FVLYASAWLS FLNGGE I ALWS LC P F 1 1 LYYWRWPLPCLHDLVS I LHLCVIAL VTLNI ( SEQ ID 

NO:257) 

TM3-U63) YVWLWLDWCCTCS ILHLCV'IALFVLYASAWLSFLNGGEIALWSLIVTLNI ( SEQ ID NO: 258) 

3 0 TM3-(164) YAI FVLYASAWLAI I LYYWRWP LPCLHDLGGE IALWS LNCP F IVTLNI ( S EQ ID NO: 259) 

Non- limiting examples of consensus GPR polypeptides for 
domain V across several or many, such as 1-500, or any range or value 
therein, G-protein receptors are as follows: 

TM5- (1) CDVFVFVDIMLCTASIFNLCAISVG (SEQ ID NO:260) 

35 TM5- (2) YAI FVLYD IMLCTAS I FNLCAI SVG ( S EQ ID N0:261) 

TM5-(3) D YAI FVFVD IMLMTAS I FNLMAI SVG ( S EQ ID NO:262) 

TM5- (4) DYAI FVFVD IMLHTTASTI FNLMATITVG (SEQ ID N0:263) 

TM5-(5) CDVAWYSSDIMLFYVCTASIFSSNLCAISSVG(SEQ ID N0:264) 

TM5-(6) FLFCSLGSF YIP I AVI LVD IMLCTAS I FNLCAI SVG (SEQ ID NO:265) 

40 TM5-(7) YAI FVLYDFLFCSLGSFYIPIAVILIMLCTAS I FNLCAI SVG ( SEQ ID NO:266) 

TM5-(8) DYAI FVFVD IMLMTAS I FLFCSLGS FY I PIAVTLISVG (SEQ ID NO:267) 

TM5-(9) DYAI FVFVD IMLHTTASTI FNLMAFLFCS LGS FYI P I AVILTITVG ( SEQ ID NO:268) 
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CDVAVVYSSDIMLFYVCTASIFSSNLFLFCSLGSFYCAISSVGfSEQ ID NO: 269) 
CDVFVFVDIMLCTASIFNWILSSIGSFFAPCLIl^VYLLCAISVG (SEQ ID NO: 270} 
YAIFVLYDIMLCTASIFNUJlIWyiLSSIGSFFAPCLILLVYLSVGtSEQ ID N0:27l) 
DYAIFVFVDIVnfILSSIGSFFAPCLIl^VYIASIFm^lAISVG(SEQ ID NO: 272) 
DYAIWILSSIGSFFAPCLILLVYLIMLHTTASTIFNLMATITVG (SEQ ID NO: 273) 
CDVAWYSSDIMLFYVCWYILSSIGSFFAPCLILLVYLSSNLCAISSVG (SEQ ID NO : 2 74 ) 
CDVFVFVDIMLCTASIFWYVISSSIGSFFAPCLIl^VYNL^ ID NO:275) 
YAIFVLYDIl^CTASIFm,CAIWYVISSSIGSFFAPCLINHLVYSVG(SEQ ID NO:276) 
DYAIFVFVWYVISSSIGSFFAPCLIl^V^ ID NO: 277) 
DYAIFVFVDIMLHTTASTIFWYVISSSIGSFFAPCLINHLVYTVG (SEQ ID NO:278) 
CDVAVVYSSDIMLFYVCTASIFSWYVISIGSFFAINHLVYNLCAISSVGfSEQ ID NO: 279) 
CDVFVFVD IMLCTAS I FNLCAITYAI SSSVISFYI P VAI LVTYT (SEQ Ti, NO: 280) 
YAI FVL YD IMLCTATYAI S S SVT S F Y I P VAI LVTYTS I FNLCAI S VG ( S E Q ID NO: 281) 
DYAIFVFVDIMLrTTATYAISSSVISFYIPVAILVTYTISVG(SEQ ID NO:282) 
TYAI S S S VI S FY I P VATD YAI FVFVD IMLHTTAST I FNLMAT I TVG (SEQ ID NO:283) 
CDVAVVYSSDI1^FYVCTATYAISSSVISFYIPVAILVTYTSSVG(SEQ ID NO: 284) 
CDVFVFVDFVIYSSWSFYLPFGVTVXVTACTASIFNLCAISVG (SE ID NO: 285) 
YAI FVLYDFVI YS S WS FYLPFGVTVLVYAS I FNLCAI SVG (SEQ ID NO:286) 
D YAI FVFVD FVT YS SWS F YLP FGVTVLVYATAS I FNLMAI SVG (SEQ ID NO: 287) 
D YAI FVFVD FVIYSSWSFYLPFGVTVLVYAHT^ (SEQ ID NO:288) 
CDVAVVYSSDFVTYSSWSFYLPFGVTVYVCTASIFSSNL ID NO: 289) 
CDVFVFVD IMLCTASYTIYSTCGAFYIPSVLLIILYGNLCAISVG (SEQ ID NO: 290) 
YAIFVLYDIMLCTASYTIYSTCGAFYIPSVLLIILYGNLCAISVG ( SEQ ID NO: 291) 
DYAI FVFVD IMLWTAS YTI YSTCGAFYI PSVLLI ILYGNLMAI SVG ( S EQ ID NO: 292) 
DYAIFVFVDIMiHTTASYTIYSTCGAFYIPSVLLIILYGMATITVG ( SEQ ID NO: 293) 
CDVAWYSSDIMSYTIYSTCGAFYIPSVLLIILYGIFSSNLCAISSVG(SEQ ID NO: 294) 
CDVFVF FVL I GSFVAVD IMLCTAS I FNLCAI SVG (SEQ ID NO: 295) 
YAI FVL Y FVL I GSFVAD IMLCTAS I FNLCAI SVG (SEQ ID NO: 296) 
DYAI FVFVF^IGSFVADIMLrnAS I FNLMAI SVG (SEQ ID NO: 297) 
DYAI FVFVFVLIGSFVAD IMLHTTAST I FNLMATITVG (SEQ ID NO: 298) 
CDVAWYSSFVLIGSFVADIMLFYVCTASIFSSNLCAISSVG (SEQ ID NO: 299) 
CX)VFVFVDI^CFFIPTLIMVITYFNLCAISVG (SEQ ID NO: 300) 
YAI FVLYDIMLCFF I PTLIMVTTYF FNLCAI SVG (SEQ ID NO: 301) 
DYAI FVFVD IML^ITF I PTLIMVITYFNLMAI SVG (SEQ ID NO: 3 02) 
DYAI FVFVD IMIJn'FF I PTLIMVTTYFNLMATITVG (SEQ ID NO: 3 03) 
CDVAVVYSSDI>1LFYVCFFIPTLI>IVITYFSSNLCAISSVG(SEQ ID NO: 304) 
CD VVYGLVDGLVTFYLPLLIMCITYYD IMLCTAS I FNLCAI SVG (SEQ ID NO: 305) 
YAIVYGLVDGLVTFYLPLLIMCITYYD IMLCTAS I FNLCAI SVG (SEQ ID NO: 306) 
DYAIVYGLVDGLVTFYLPLLIMCITYYDIMLMTAS I FNLMAI SVG ( SEQ ID NO:307) 
DYAIVYGLVDGLVTFYLPLJjIMCISSDIMLHTTASTIFNLMATITVG ( SEQ ID NO: 308) 
CDVVYDGLVTFYLPXJJMrTTYYnT^ iu NU:jQ9) 



CDVFVFVD IMLLVI FLGLVI VI P FVLI IVSYAS I FNLCAI SVG ( SEQ ID NO:310) 
YAI FVL YD IMLLVI FLGLVI VT P FVLI I VS YAI FNLCAI SVG (SEQ ID NO : 311 ) 
DYAIFVFVDIMl^VIFl^LVIVIPFVLIIVSYAIFNLM^ ID NO:312) 

DYAI FVFVD IMLHTLVI FLGLVI VIP FVLI I VS YAI FNLMATITVG (SEQ ID NO: 313) 
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TM5-(55) CDVAWYSSDIMLFLVIFLGLVIVIPFVLIIVSYAIFSSNLCAISSVGtSrJQ ID NO: 314) 

IMS- (56) CDVTVFVDIMLCTALMIYII/^LIIIIPFIilVMSYVSIF^ ID NO:315) 

TM5- (57) YAIFVLYDIMLCTAI^YIUKSLIIIIPFIJ.IVMSYVSIFNLCAISVG (SEQ ID NO: 3 16) 

TM5-(58) DYAIFVFVDIMIOTASIFNLMIYII^LIIIIPFLL^^ ID NO: 317) 

TM5-(59) DYAIFVTVDIMLHTTASTILMIYIIjGGLIIIIPFLLIVMSYVTTVG (SEQ ID NO:318) 

TM5-(60) CDVAVVYSSDIMLFYVCTAYILGGLIPFLLIVMTYVSIFTNLCAISSVG (SEQ ID NO:319) 

TM5-(61) CDVFVFVDIMLCTASIFNIJiMIHIMEVTIIVIPF^IVTSYACAISVG(SEQ ID NO:320) 

TM5-(62) YAI FVLYD IMLCTAS I FNLLMIHIMEVT I IVIPFVLIVT S YACAI SVG ( SEQ ID NO:321) 

TM5-(63) DYAIFVFVDIMLOTASIFLMIHIMEVIIIVIPFVLIVISYAISVG (SEQ ID NO:322) 

TM5-(64) DYAIFVTVDIMLHTTASTILMIHIMEVT I IVI P FVLIVT S YAITVG ( SEQ ID NO:323) 

TM5-(65) CDVAVVYSSDIMLFYVCTASIFLOTHIMEVIIIVTPFVLIVISYAAISSVG(SEQ ID NO:324) 

Non- limiting examples of longer consensus GPR polypeptides 
for domain V across several or many, such as 1-500, or any value or 
range therein, G-protein receptors are as follows: 

T M l ( 1 ) 

TMINWPALSIWIIINTIGGNILVIMAVSIYT^ 

IGYVCSSSLGINPVIIYTLFtSEQ ID NO: 325) 

T M 1 - ( 2 , 

NWPALSIWIIINTIGGNILVIMAVTI 

VCTTTLGINPVTIYTLF (SEQ ID NO: 326) 

T M 1 - ( 3 ) 

NTWPALTIVVIIINTIGGNILVIMAVSIYTTIJDVMLCTATILNL 

VCSTSLGINPVIIYTLF(SEQ ID NO: 327) 

T M ! . ( 5 } 

NWPALTIVVIIINTIGGNILVIMAVTIYTTLDVMLCTATILNLLITLFVLIGT 
VCTLGINPVIIYTLF (SEQ ID NO: 328) 

T M I" - ( 6 ) 

NWKNWSALLTIVVIILTIAGNILVIMAVSSUJVMLCTASILNLL^ 

VCSSSLGINPVIIYTLF(SEQ ID NO: 329) 

T M 1 - ( 7 , 

ITITWIAVLILITVAGNVWCIAVGSira 

YVCSSSLGINPVIIYTLFtSEQ ID NO: 330) 

T M 1 " ( 8 ) 

TLTLVCIACLUSLTVFGNVL^ 

SSLGINPVIIYTLF(SEQ ID NO: 331) 

T M x . ( 9 ) 

TAAIAAAITFLILFTIFGNALVIIAVLSIYTSIJJVl^CTASIIiNLLISLFVLIGSFVAFFIPLTIMV 
GYVCSSSLGINPVIIVTLF(SEQ ID NO: 332) 

T M 1 -(10, 

AISVGLVLGAFILFAIVGNILVILSVANWP^^ 

FVAFFIPLTIMV1TYFLFNVFFVWIGYVCSSSLGINPV1IYTLF (SEQ ID NO: 333) 
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T H 1 til) 

AAIJIGAIJJU^VLATVGGNLLVIVAIASLDVMLCT 

SS SLGINPVIIYTLF (SEQ ID NO: 334) 

T M 1 - (12) 

5 TAGDCLIMLIVIXIVAGNVLVTVAISI^ 
SLGINPVIIYTLF (SEQ ID NO: 335) 

T M 1 - (13) 

VITIAVVTAWSIiMTIVGNVLVMISFSIYTSLDVMLCTASILNLLISLFV^ 
YVCSSSLGINPVIIYTLF(SEQ ID NO: 3 36) 
10 T M 1 - (1 4 ) 

MVFI ATVRGSLSLVTWGNI LVMLS I S I YTSLDVMLCTAS I LNLLISLFVLIGSFVAFFIPLTIMVITYFLFNVFFVWIG 
YVCSSSLGINPVIIYTLF(SEQ ID NO: 33 7) 

T M 1 (15) 

WFIAFLTGILALVTIIGNILVIVSFSIYTSHTVMLCTASIL^ 
15 VCSS SLGINPVIIYTLF (SEQ ID NO: 338) 

Non- limiting examples of longer consensus GPR polypeptides 
for domain V across several or many, such as 1-500, or any value or 
range therein, G-protein receptors are as follows: 

T m 3 - ( i 6 5 ) 

20 NWPALS I WI I INTIGGNILVIMAFFACFVLVLTQSSIFSLLAIAINLLISLFVLIGSFVAFFIPLT^ 
VWIGYVCSSSLGINPVIIYTLFtSEQ ID NO:339) 

T M 3 - ( 1 6 6 ) 

NWPALSI WI I INTIGGNILVIMAFFACFVLVLTQSS IFSL 
CSSSLGINPVIIYTLF(SEQ ID NO: 34 0) 
25 T M 3 - ( 16 7 ) 

NWPALSIWIIINTIGGNILVIMAVMVACPVLILTQSSIIALLAJ^^ 
LGINPVIIYTLF (SEQ ID NO: 341) 

T M 3 - ( i g 8 , 

NWPALS IVVI I INTIGGNILVIMAVLWLALDYVASNASVLNIiLLISFFFIPL 
3 0 PVIIYTLF(SEQ ID NO: 342) 

T M 3 - ( i 6 9 } 

NWPAXS I WI I INTIGGNILVIMAVLYWSNASVMNI^^ 
IIYTLF (SEQ ID NO:343) 

T M 3 - ( 1 7 0 ) 

3 5 NWPALSIWIIINTIGGNILVIMAVLWIAIDYVASNASVI^^ 
SLGINPVIIYTLF (SEQ ID NO: 344) 

T M 3 - ( i 7 i j 
^PA L. STWTTTNTIGGNILVIMAVLrprLQKggVQITVIilLCAIjSbSt' ^A^ FlPLTIMVlTYFLFNVFFV WIGYVCSSS 



LGINPVIIYTLF (SEQ ID NO: 345) 
40 T M 3 - ( 17 2 ) 

NWPALSIVVIIINTIGGNILVIMAVCITYLQYLGINASSCSITAPTIIG^ 
SSLGINPVIIYTLF (SEQ ID NO: 34 6) 
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T M 3 (17 3 ) 

NWPALS I WI I INTIGGNILVIMAVFHNFFP IAALFAS I YSMTAVAGS FVAFFIPLTIMVTTYFLFNVFFVWIGYVCS S S 
LGINPVIIYTLF (SEQ ID NO: 347) 

T M 3 - ( 17 4 ) 

NWPALS IWT I INTIGGNILVIMAVIASASVSFNLYAS VFLLTCLS IGS FVAFFIPLTIMVTTYFLFNVFFVWIGYVCS S 
SLGINPVIIYTLF (SEQ ID NO: 34 8) 

As another non- limiting, illustrative example of a GPR 
polypeptide consensus sequences across each individual or different 
transmembrane domains of 5-HT receptors may be made, such as for 5- 
HT, as the following: 

5HT consensus (4) KNASALLSVIIINSIGGNWTAVS (SEQ ID NO: 349); 
5HT consensus (5) YFLMSLAVTDLWSFVMPVSAL (SEQ ID NO: 350); 
5HT consensus (6) AITKIAITWAISGVSVPFIPVWG (SEQ ID NO: 3 51); and 
5HT consensus (7) LGIIFGTFIIIWLPFFITNLVSPI (SEQ ID N0:352); 

Wherein variations and substitutions of amino acids may be made as 
described herein. 

Alternatively, 5-HT consensus sequences may be 
provided as consensus peptides of the present invention as consensus 
peptides for individual transmembrane domains, such as 5-HT domains 
III, V and VII, e.g., as follows: 

5-HT consensus (8): IWISLDVLFSTASSIMHLCAISL (SEQ ID NO:353) ' 
S-HT consensus (9) : GYTI YSTLVTFYI PS VTMVITYG (SEQ ID NO:354) 
5-HT consensus (10) : LLNFFNWIGYLNSLINPVTYTLF (SEQ ID NO -.3 55) 

This invention is also directed to an antibody which binds 
an epitope specific for a GPR polypeptide of the present invention 
and the use of such an antibody to detect the presence of , or measure 
the quantity or concentration of, the GPR protein in a cell, a cell 
or tissue extract, a biological fluid, an extract thereof, a 
solution, or sample, in vitro, in situ, or in vivo. 
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The term "antibody" is meant to include polyclonal 
antibodies, monoclonal antibodies tmAbs) , chimeric antibodies, anti- 
idiotype (anti-Id) antibodies to antibodies specific for GPR 
polypeptide of the present invention, as well as fragments, consensus 
polypeptides or chemical derivatives thereof. 

Polyclonal antibodies are heterogeneous populations of 
antibody molecules derived from the sera of animals immunized with 
an antigen. 

A monoclonal antibody contains a substantially homogeneous 
population of antibodies specific to antigens, which population 
contains substantially similar epitope binding sites. MAbs may be 
obtained by methods known to those skilled in the art. See, for 
example Kohler and Milstein, Mature 256:495-497 (1975); U.S. Patent 
No. 4,376,110; Ausubel et al , eds . , Current Protocols in Molecular 
Biology, Wiley Interscience , N.Y., (1987, 1992); and Harlow and Lane 
Antibodies; A Laboratory Manual Cold Spring Harbor Laboratory (1988) , 
the contents of which references are incoporated entirely herein by 
reference. Such antibodies may be of any immunoglobulin class 
including IgG, IgM, IgE, IgA, GILD and any subclass thereof. A 
hybridoma producing a mAb of the present invention may be cultivated 
in vitro, in situ or in vivo. Production of high titers of mAbs in 
vivo or in situ makes this the presently preferred method of 
production. 

Chimeric antibodies are molecules different portions of 
which are derived from different animal species, such as those having 
variable region derived from a murine mAb and a human immunoglobulin 
constant region, which are primarily used to reduce immunogenicity 
in application and to increase yields in production, for example, 
where murine mAbs have higher yields from hybridomas but higher 
immunogenicity in humans, such that human/murine chimeric mAbs are 
used. Chimeric antibodies and methods for their production are known 
in the art (Cabilly et al , Proc. Natl. Acad. Sci . USA 81:3273-3277 
(19 84); Morrison et al . , Proc. Natl. Acad. Sci. USA 81:6851-6855 
(1984); Boulianne et al . , Nature 312 : 643 - 646 (1984): rah-Mlyot- ^\ t 
European Patent Application 125023 (published November 14, 1984); 
Neuberger et al . , Nature 314:268-270 (1985); Taniguchi et al . , 
European Patent Application 171496 (published February 19, 1985); 



WO 94/05695 



< 

PCT/US93/08528 



36 



Morrison et al . , European Patent Application 173494 (published March 
5, 1986); Neuberger et al., PCT Application WO 86/01533, (published 
_ March 13, 1986); Kudo et al . , European Patent Application 184187 
(published June H, 1986); Morrison et al . , European Patent 
5 Application 173494 (published March 5, 1986); Sahagan et al., J. 
Immunol. 137:1066-1074 (1986); Robinson et al . , International Patent 
Publication No .PCT/US86/02269 (published 7 May 1987); Liu et al . , 
Proc. Natl. Acad. Sci . USA 84:3439-3443 (1987); Sun et al . , Proc. 
Natl. Acad. Sci. USA 84:214-218 (1987); Better et al . , Science 

10 240:1041- 1043 (1988); and Harlow and Lane Antibodies: A Laboratory 
Manual Cold Spring Harbor Laboratory (1988)). These references are 
incorporated entirely herein by reference. 

An anti- idiotypic (anti-Id) antibody is an antibody which 
recognizes unique determinants generally associated with the antigen- 

15 binding site of an antibody. An Id antibody can be prepared by 
immunizing an animal of the same species and genetic type (e.g., 
mouse strain) as the source of the mAb with the mAb to which an anti- 
Id is being prepared. The immunized animal will recognize and 
respond to the idiotypic determinants of the immunizing antibody by 

20 producing an antibody to these idiotypic determinants (the anti- Id 
antibody). See, for example, U.S. patent No. 4,699,880, which is 
herein entirely incorporated by reference. 

The anti -Id antibody may also be used as an "immunogen" to 
induce an immune response in yet another animal, producing a so- 

25 called anti -anti -Id antibody. The anti -anti -Id may be epitopically 
identical to the original mAb which induced the anti -Id. Thus, by 
using antibodies to the idiotypic determinants of a mAb, it is 
possible to identify other clones expressing antibodies of identical 
specificity. 

30 Accordingly, mAbs generated against a GPR polypeptide of 

the present invention may be used to induce anti -Id antibodies in 
suitable animals, such as BALB/c mice. Spleen cells from such 
immunized mice are used to produce anti- Id hybridomas secreting anti- 
Id mAbs. Further, the anti- Id mAbs can be coupled to a immunogenic 

35 carrier such as keyhole limpet hemocyanin (KLH) or cationized bovine 
serum albumin and used to immunize additional BALB/c mice. Sera from 
these mice will contain anti -anti -Id antibodies that have the binding 
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properties of the original mAb specific for a GPR polypeptide 
epitope . 

The anti-Id mAbs thus have their own idiotypic epitopes, or 
"idiotopes" structurally similar to the epitope being evaluated. 
5 The term "antibody" is also meant to include both intact 

molecules as well as fragments thereof, such as, for example, Fab and 
F(ab*) 2 i which are capable of binding antigen. Fab and F(ab') : 
fragments lack the Fc fragment of intact antibody, clear more rapidly 
from the circulation, and may have less non-specific tissue binding 
10 than an intact antibody (Wahl et al . , J. Nucl . Med. 24:316-325 
(1983) ) . 

It will be appreciated that Fab and F(ab') : and other 
fragments of the antibodies useful in the present invention may be 
used for the detection and quantitation of a GPR polypeptide 

15 according to the methods disclosed herein for intact antibody 
molecules. Such fragments are typically produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or 
pepsin (to produce F(ab') 2 fragments). 

An antibody is said to be "capable of binding" a molecule 

20 if it is capable of specifically reacting with the molecule to 
thereby bind the molecule to the antibody. The term "epitope" is 
meant to refer to that portion of any molecule capable of being bound 
by an antibody which can also be recognized by that antibody. 
Epitopes or "antigenic determinants" usually consist of chemically 

25 active surface groupings of molecules such as amino acids, lipids or 
sugar side chains and have specific three dimensional structural 
characteristics as well as specific charge characteristics. 

An "antigen" is a molecule or a portion of a molecule 
capable of being bound by an antibody which is additionally capable 

3 0 of inducing an animal to produce antibody capable of binding to an 
epitope of that antigen. An antigen may have one, or more than one 
epitope. The specific reaction referred to above is meant to 
indicate that the antigen will react, in a highly selective manner, 

with its corresponding anfjhndy anH nnf i.H 1-V1 t-hn muUir^u ■ ,£ ijLlllii 

35 antibodies which may be evoked by other antigens. 

The antibodies, or fragments of antibodies, useful in the 
present invention may be used to quantitatively or qualitatively 
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detect a GPR polypeptide in a sample or to detect presence of cells 
which express a GPR polypeptide of the present invention. This can 
be accomplished by immunofluorescence techniques employing a 
f luorescently labeled antibody (see below) coupled with light 
5 microscopic, flow cytometric, or fluorometric detection. 

The antibodies (of fragments thereof) useful in the present 
invention may be employed histologically, as in immunofluorescence 
or immunoelectron microscopy, for in situ detection of a GPR 
polypeptide of the present invention. In situ detection may be 
10 accomplished by removing a histological specimen from a patient, and 
providing the a labeled antibody of the present invention to such a 
specimen. The antibody (or fragment) is preferably provided by 
applying or by overlaying the labeled antibody (or fragment) to a 
biological sample. Through the use of such a procedure, it is 
15 possible to determine not only the presence of a GPR polypeptide but 
also its distribution on the examined tissue. Using the present 
invention, those of ordinary skill will readily perceive that any of 
wide variety of histological methods (such as staining procedures) 
can be modified in order to achieve such in situ detection. 

Such assays for a GPR polypeptide of the present invention 
typically comprise incubating a biological sample, such as a 
biological fluid, a tissue extract, freshly harvested cells such as 
lymphocytes or leukocytes, or cells which have been incubated in 
tissue culture, in the presence of a detectably labeled antibody 
capable of identifying a GPR polypeptide, and detecting the antibody 
by any of a number of techniques well-known in the art. See, e.g., 
Harlow and Lane, supra; Ausubel et al , supra : and Sambrook et al , 
supra . 

The biological sample may be treated with a solid phase 
support or carrier, such as nitrocellulose, or other solid support 
or carrier which is capable of immobilizing cells, cell particles or 
soluble proteins. The support or carrier may then be washed with 
suitable buffers, followed by treatment with a detectably labeled GPR 
polypeptide- specific antibody. The solid phase support or carrier 
3 5 may then be washed with the buffer a second time to remove unbound 
antibody. The amount of bound label on said solid support or carrier 
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may then be detected by known method steps, see, e.g., Harlow, supra ; 
Ausubel, supra : or Sambrook, supra . 

By "solid phase support", "solid phase carrier", "solid 
support", "solid carrier", "support" or "carrier" is intended any 
support or carrier capable of binding antigen or antibodies. Well- 
known supports or carriers, include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon amylases, natural and 
modified celluloses, polyacrylamides , gabbros, and magnetite. The 
nature of the carrier can be either soluble to some extent or 
insoluble for the purposes of the present invention. The support 
material may have virtually any possible structural configuration so 
long as the coupled molecule is capable of binding to an antigen or 
antibody. Thus, the support or carrrier configuration may be 
spherical, as in a bead, or cylindrical, as in the inside surface of 
15 a test tube, or the external surface of a rod. Alternatively, the 
surface may be flat such as a sheet, polymer test strip, etc. 
Preferred supports or carriers include polystyrene beads. Those 
skilled in the art will know many other suitable carriers for binding 
antibody or antigen, or will be able to ascertain che same by use of 

2 0 routine experimentation. 

The binding activity of a given lot of anti-GPR polypeptide 
antibody -may be determined according to well known method steps. 
Those skilled in the art will be able to determine operative and 
optimal assay conditions for each determination by employing routine 
25 experimentation. See, e.g., Harlow, supra . 

Other such steps as washing, stirring, shaking, filtering 
and the like may be added to the assays as is customary or necessary 
for the particular situation. 

One of the ways in which a GPR polypeptide- specif ic 
antibody, anti - idiotype antibody or fragment thereof, can be 
detectably labeled is by linking the same to an enzyme and use in an 
enzyme immunoassay (EIA) . This enzyme, in turn, when later exposed 
to an appropriate substrate, will react with the substrate in such 
a manner as to produce a chemical moiety wh-irh n n h e tietcctad, foi 

3 5 example, by spectrophotometric , fluorometric or by visual means. 

Enzymes which can be used detectably label the antibody include, but 
are not limited to, malate dehydrogenase, staphylococcal nuclease, 
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delta- 5 -steroid isomerase, yeast alcohol dehydrogenase, alpha - 
glycerophosphate dehydrogenase, triose phosphate isomerase, 
" .horseradish peroxidase , alkaline" phosphatase , asparaginase, glucose 
oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose- 
5 6- phosphate dehydrogenase, glucoamylase and acetylcholinesterase. 
The detection can be accomplished by colorimetric methods which 
employ a chromogenic substrate for the enzyme. Detection may also 
be accomplished by visual comparison of the extent of enzymatic 
reaction of a substrate in comparison with similarly prepared 

10 standards. See, Harlow, supra , Ausubel, supra . 

Detection may be accomplished using any of a variety of 
other immunoassays. For example, by radioactivity labeling the 
antibodies or antibody fragments, it is possible to detect R-PTPase 
through the use of a radioimmunoassay (RIA) . A good description of 

15 RIA maybe ■ found in Laboratory Techniques and Biochemistry in 
Molecular Biology, by Work et al . , North Holland Publishing Company, 
NY (1978) with particular reference to the chapter entitled "An 
Introduction to Radioimmune Assay and Related Techniques" by Chard, 
incorporated entirely by reference herein. The radioactive isotope 

20 can be detected by such means as the use of a 7- counter, a 
scintillation counter or by autoradiography. 

It is also possible to label an anti-GPR polypeptide 
antibody, anti- idiotype antibody or fragment thereof, with a 
fluorescent compound. When the f luorescently laoeled antibody is 

25 exposed to light of the proper wave length, its presence can be then 
be detected due to fluorescence. Among the most commonly used 
fluorescent labelling compounds are fluorescein isothiocyanate, 
rhodamine, phycoerythrin , phycocyanin, allophycocyanin, 
o-phthaldehyde and f luorescamine, commercially available, e.g., from 

3 0 Molecular Probes, Inc. (Eugene, Ore.). 

The antibody can also be detectably labeled using 
fluorescence emitting metals such as 152 EU, or others of the 
lanthanide series. These metals can be attached to the antibody 
using such metal chelating groups as diethylenetriamine pentaacetic 

3 5 acid (EDTA) . 

The antibody also can be detectably labeled by coupling it 
to a chemiluminescent compound. The presence of the 
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chemiluminescent- tagged antibody is then determined by detecting the 
presence of luminescence that arises during the course of a chemical 
reaction. Examples of particularly useful chemiluminescent labeling 
compounds are luminol , isoluminol , theromatic acridinium ester, 
imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label 
the antibody of the present invention. Bioluminescence is a tyoe of 
chemiluminescence found in biological systems in which a catalytic 
protein increases the efficiency of the chemiluminescent reaction. 
The presence of a bioluminescent protein is determined by detecting 
the presence of luminescence. Important bioluminescent compounds 
for purposes of labeling are luciferin, luciferase and aequorin. 

An antibody molecule of the present invention may be 
adapted for utilization in a immunometric assay, also known as a 
15 "two -site" or "sandwich" assay. In a typical immunometric assay, a 
quantity of unlabeled antibody (or fragment of antibody) is bound to 
a solid support or carrier and a quantity of detectably labeled 
soluble antibody is added to permit detection and/or quantitation of 
the ternary complex formed between solid-phase antibody, antigen, and 
2 0 labeled antibody. 

Typical, and preferred, immunometric assays include 
"forward" assays in which the antibody bound to the solid phase is 
first contacted with the sample being tested to extract the antigen 
form the sample by formation of a binary solid phase antibody- antigen 
25 complex. After a suitable incubation period, the solid support or 
carrier is washed to remove the residue of the fluid sample, 
including unreacted antigen, if any, and then contacted with the 
solution containing an unknown quantity of labeled antibody (which 
functions as a "reporter molecule"). After a second incubation 
period to permit the labeled antibody to complex with the antigen 
bound to the solid support or carrier through the unlabeled antibody, 
the solid support or carrier is washed a second time to remove the 
unreacted labeled antibody. 

In another type of "san dwirh" assay, — which may alsu 



30 



35 useful, with the antigens of the present invention, the so-called 
"simultaneous" and "reverse" assays are used. A "simultaneous" and 
"reverse" assays are used. A simultaneous assay involves a single 
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incubation step as the antibody bound to the solid support or carrier 
and labeled antibody are both added to the sample being tested at the 
same time. After the incubation is completed, the solid support or 
carrier is washed to remove the residue of fluid sample and 
5 uncomplexed labeled antibody. The presence of labeled antibody 
associated with the solid support or carrier is then determined as 
it would be in a conventional "forward" sandwich assay. 

In the "reverse" assay, stepwise addition first of a 
solution of labeled antibody to the fluid sample followed by the 

10 addition of unlabeled antibody bound to a solid support or carrier 
after a suitable incubation period is utilized. After a second 
incubation, the solid phase is washed in conventional fashion to free 
it of the residue of the sample being tested and the solution of 
unreacted labeled antibody. The determination of labeled antibody 

15 associated with a solid support or carrier is then determined as in 
the "simultaneous" and "forward" assays. See, e.g., for the above- 
mentioned immunological techniques, Harlow, supra; Ausubel et al, 
supra; and Sambrook et al, supra . GPR polypeptides of the present 
invention can be made by chemical synthesis or by recombinant 

2 0 methods, wherein chemical synthesis is preferred. 

Synthetic — producti on of transmembrane proteins of i-hg 
present invention 

GPR polypeptides, variants and chemical derivatives thereof 
can be synthesized according to known method steps, including 
25 portions of known GPR transmembrane domains, consensus peptides 
thereof, conservative substitution derivative thereof or functional 
derivatives thereof. 

Chemical polypeptide synthesis is a rapidly evolving area 
in the art, and methods of solid phase polypeptide synthesis are 

3 0 well -described in the following references, hereby entirely 

incorporated by reference: (Merrifield, B., J. Amer. Chem. Soc. 
85:2149-2154 (1963); Merrifield, B., Science 232:341-347 (1986); 
Wade, J.D. et al., Biopolymers 25 :S21-S37 (1986); Fields, G.B., Int. 
J. Polypeptide Prot. Res. 35:161 (1990); MilliGen Report Nos . 2 and 
35 2a, Millipore Corporation, Bedford, MA, 1987) Ausubel et al, supra, 
and Sambrook et al . supra . 
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In general, as is known in che art, such methods involve 
blocking or protecting reactive functional groups, such as free 
amino, carboxyl and thio groups. After polypeptide bond formation, 
the protective groups are removed (or de-protected) . Thus, the 
5 addition of each amino acid residue requires several reaction steps 
for protecting and deprotecting. Current methods utilize solid phase 
synthesis, wherein the C- terminal amino acid is covalently linked to 
an insoluble resin particle large enough to be separated from the 
fluid phase by filtration. Thus, reactants are removed by washing 
10 the resin particles with appropriate solvents using an automated 
programmed machine The completed polypeptide chain is cleaved from 
the resin by a reaction which does not affect polypeptide bonds. 

In the more classical method, known as the "tBoc method, " 
the amino group of the amino acid being added to the resin -bound 
15 C- terminal amino acid is blocked with tert -butyloxvcarbonyl chloride 
(tBoc) . This protected amino acid is reacted with the bound amino 
acid in the presence of the condensing agent 
dicyclohexylcarbodiimide, allowing its carboxyl group to form a 
polypeptide bond the free amino group of the bound amino acid. The 
20 amino -blocking group is then removed by acidification with 
trifluoroacetic acid (TFA) ; it subsequently decomposes into gaseous 
carbon dioxide and isobutylene. These steps are repeated cyclically 
for each additional amino acid residue. A more vigorous treatment 
with hydrogen fluoride (HF) or trif luoromethanesulf onyl derivatives 
25 is common at the end of the synthesis to cleave the benzyl -derived 
side chain protecting groups and the polypeptide- resin bond. 

More recently, the preferred "Fmoc" technique has been 
introduced as an alternative synthetic approach, offering milder 
reaction conditions, simpler activation procedures and compatibility 
30 with continuous flow techniques. This method w.-s used, e.g., to 
prepare the peptide sequences disclosed in the present application. 
Here, the oc- amino group is protected by the base labile 
9-fluorenylmethoxycarbonyl (Fmoc) group. The benzyl side chain 
protecting groups are replaced b y rho mo^o .r.^ labile — L-butyl 



35 derivatives. Repetitive acid treatments are replaced by deprotection 
with mild base solutions, e.g., 20% piperidine in dimethylf ormamide 
(DMF) , and the final HF cleavage treatment is eliminated. A TFA 
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solution is used instead to cleave side chain protecting groups and 
the polypeptide resin linkage simultaneously. 

At least three different polypeptide- resin linkage agents 
can be used: substituted benzyl alcohol derivatives that can be 
cleaved with 95% TFA to produce a polypeptide acid, methanolic 
ammonia to produce a polypeptide amide, or 1% TFA to produce a 
protected polypeptide which can then be used in fragment condensation 
procedures, as described by Atherton, E. et al . , J. Chem. Soc. Perkin 
Trans. 1:538-546 (1981) and Sheppard, R.C. et al . , Int. J. 
Polypeptide Prot. Res. 20:451-454 (1982). Furthermore, highly 
reactive Fmoc amino acids are available as pentaf luorophenyl esters 
or dihydro-oxobenzotriazine esters derivatives, saving the step of 
activation used in the tBoc method. 

Sequences available to use as a basis for polypeptide 
synthesis can be based on published sequences of G-protein coupled 
receptors, ligands and/or effectors, wherein the transmembrane or 
functional domains correspond to sections of hydrophobic or other 
amino acids of 5 to 100 amino acids, such as 5-10, 10-15, 15-25, 
20-25, 23-27, 25-30, 28-35, 20-40, 10-40, 20-30, 30-40, 40-50, 10-80, 
20-60 or 25-40 amino acids in length. Recombinant production of GPR 
polypeptides can be accomplished according to known method steps. 
Standard reference works setting forth the general principles of 
recombinant DNA technology include Watson, j.d. et al., Molecular 
Biology of the Gene, Volumes I and II, The Benjamin/Cummings 
Publishing Company, Inc., publisher, Menlo Park, CA (1987); Darnell, 
J.E. et al., Molecular Cell Biology, Scientific American Books, Inc., 
publisher, New York, NY (1986); Lewin, B.M., Genes III, John Wiley 
Sc Sons, publishers, New York, NY (1989); Old, R.W. , et al., 
Principles of Gene Manipulation: An Introduction to Genetic 
Engineering, 2d edition, University of California Press, publisher, 
Berkeley, CA (1981); Ausubel et al, eds . , Current Protocols in 
Molecular Biology, Wiley Interscience, publisher, New York, NY (1987, 
1992) ; and Sambrook et al., Molecular Cloning: A Laboratory Manual , 
Second Edition, Cold Spring Harbor Laboratory, publisher, Cold 
Spring Harbor, NY (1989) , the entire contents of which references are 
herein incorporated by reference. 



o 

WO 94/05695 



o 

PCT/US93/08528 



- 45 - 

A nucleic acid sequence encoding a GPR polypeptide of the 
present invention may be recombined with vector DNA in accordance 
with conventional techniques, including blunt -ended or 
staggered -ended termini for ligation, restriction enzyme digestion 
5 to provide appropriate termini, filling in of cohesive ends as 
appropriate, alkaline phosphatase treatment to avoid undesirable 
joining, and ligation with appropriate ligases. Techniques for such 
manipulations are disclosed, e.g., by Ausubel et al , supra, and are 
well known in the art. 

10 A nucleic acid molecule, such as DNA, is said to be 

"capable of expressing" a polypeptide if it contains nucleotide 
sequences which contain transcriptional and translational regulatory 
information and such sequences are "operably linked" to nucleotide 
sequences which encode the polypeptide. An operable linkage is a 

15 linkage in which the regulatory DNA sequences and the DNA sequence 
sought to be expressed are connected in such a way as to permit gene 
expression as GPR polypeptides in recoverable amounts. The precise 
nature of the regulatory regions needed for gene expression may vary 
from organism to organism, as is well known in the analogous art. 

20 See, e.g., Sambrook, supra and Ausubel supra. 

The present invention accordingly encompasses the 
expression of a GPR polypeptide, in either prokaryotic or eukaryotic 
cells, although eukaryotic expression is preferred. 

Preferred hosts are bacterial or eukaryotic hosts including 

25 bacteria, yeast, insects, fungi, bird and mammalian cells either in 
vivo, or in situ, or host cells of mammalian, insect, bird or yeast 
origin. It is preferred that the mammalian celx or tissue is of 
human, primate, hamster, rabbit, rodent, cow, pig, sheep, horse, 
goat, dog or cat origin, but any other mammalian cell may be used. 

3 0 Further, by, use of, for example, the yeast ubiquitin 

hydrolase system, in vivo synthesis of ubiquitin- transmembrane 
polypeptide fusion proteins may be accomplished. The fusion proteins 
so produced may be processed in vivo or purified and processed in 

vitro, allowing synt hpgig nf a GEE — polypeptide — — &fee — ^eserrt- 

35 invention with a specified amino terminus sequence. Moreover, 
problems associated with retention of initiation codon-derived 
methionine residues in direct yeast (or bacterial) expression may be 
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avoided. Sabin et al . , Bio/Technol. 7(7): 705-709 (1989); Miller et 
al., Bio/Technol. 7(7): 698-704 (1989). 

Any of a series of yeast gene expression systems 
incorporating promoter and termination elements from the actively 
5 expressed genes coding for glycolytic enzymes produced in large 
quantities when yeast are grown in mediums rich in glucose can be 
utilized to obtain GPR polypeptides of the present invention. Known 
glycolytic genes can also provide very efficient transcriptional 
control signals. For example, the promoter and terminator signals 
10 of the phosphoglycerate kinase gene can be utilized. 

Production of GPR polypeptides or functional derivatives 
thereof in insects can be achieved, for example, by infecting the 
insect host with a baculovirus engineered to express transmembrane 
polypeptide by methods known to those of skill. See Ausubel et al, 
15 eds. Current Protocols in Molecular Biology, Wiley Interscience, 
§§16.8-16.11 (1987, 1992) . 

In a preferred embodiment, the introduced nucleotide 
sequence will be incorporated into a plasmid or viral vector capable 
of autonomous replication in the recipient host. Any of a wide 

2 0 variety of vectors may be employed for this purpose. See, e.g., 

Ausubel et al , supra . §§ 1.5, 1.10, 7.1, 7.3, 8.1, 9.6, 9.7, 13.4, 
16.2, 16.6, and 16.8-16.11. Factors of importance in selecting a 
particular plasmid or viral vector include: the ease with which 
recipient cells that contain the vector may be recognized and 
25 selected from those recipient cells which do not contain the vector; 
the number of copies of the vector which are desired in a particular 
host; and whether it is desirable to be able to "shuttle" the vector 
between host cells of different species. 

Preferred prokaryotic vectors known in the art include 

3 0 plasmids such as those capable of replication in E. coli (such as, 

for example, pBR322, ColEl, pSClOl, pACYC 184, ttVX) . Such plasmids 
are, for example, disclosed by Maniatis, T., et al . (Molecular 
Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor 
Press, Cold Spring Harbor, NY (1989); Ausubel et al, eds., Current 
3 5 Protocols in Molecular Biology, Wiley Interscience, New York, NY 
(1987, 1992)). Bacillus plasmids include pC194, pC221, pT127, etc. 
Such plasmids are disclosed by Gryczan, T. (In: . The Molecular 
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Biology of the Bacilli, Academic Press, NY (1982), pp. 307-329). 
Suitable Streptomyces plasmids include pIJlOl (Kendall, K.J. , et al . , 
J. Bacterid. 169:4177-4183 (1987)), and streptomyces bacteriophages 
such as 0C31 (Chater, K.F., et al . , In: Sixth International 
5 Symposium on Actinomycetales Biology, Akademiai Kaido, Budapest, 
Hungary (1986), pp. 45-54). Pseudomonas plasmids are reviewed by 
John, J.F., et al. [Rev. Infect. Dis. 8:693-704 (1986)), and Izaki, 
K. (jpn. J. Bacteriol. 21:729-742 (1978); and Ausubel et al, supra ) . 

The expressed protein may be isolated and purified in 

10 accordance with conventional conditions, such as extraction, 
precipitation, chromatography, affinity chromatography, 
electrophoresis, or the like. For example, the cells may be 
collected by centrif ugation, or with suitable buffers, lysed, and the 
protein isolated by column chromatography, for example, on 

15 DEAE- cellulose, phosphocellulose , polyribocytidylic acid-agarose, 
hydroxyapatite or by electrophoresis or immunoprecipitation. 
Alternatively, the transmembrane polypeptide or functional derivative 
thereof may be isolated by the use of anti- transmemorane polypeptide 
antibodies. Such antibodies may be obtained by well-known methods, 

20 some of which are mentioned below. These antibodies may be 
immobilized on cellulose, agarose, hollow fibers, or cellulose 
filters by covalent chemical derivatives by methods well known to 
those skilled in the art. 

As discussed herein, GPR polypeptides of the present 

25 invention may be further modified for purposes of drug design, such 
as for example to reduce immunogenicity , to prevent solubility and/or 
enhance delivery, or to prevent clearance or degradation. 

Appropriate modification of the primary amino acid sequence 
of GPR polypeptides of the present invention, obtained by mutagenesis 

30 or utilizing fragments of other related forms of G-protein 
transmembrane proteins, as described herein, will allow the creation 
of molecules which bind G-protein coupled receptors with higher 
affinity than that exhibited by naturally occurring transmembrane 

domains Small polypeptides — LhdL dm provide d according to the 

35 present invention which polypeptides maintain G-protein coupled 
receptor binding inhibition activity, are expected to have two 
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advantages over larger polypeptides. These advantages include (l) 
greater stability and dif fusibility , and (2) less immunogenicity . 

Since polypeptides according to the present invention are 
generally small (10-40, 20-30, 15-25, 30-45 amino acids), cell or 
5 tissue sources of G-protein coupled receptors are not required to 
practice the present invention, since known polypeptide syntheses 
steps can be used without undue experimentation to provide GPR 
polypeptides or sequences substantially corresponding thereto 
Pharmaceutical Preparations 

10 Preparations of GPR polypeptides for parenteral 

administration include sterile aqueous or non-aqueous solutions, 
suspensions, and emulsions, which may contain auxiliary agents or 
excipients which are known in the art. Pharmaceutical compositions 
such as tablets and capsules can also be prepared according to 

15 routine methods. 

By the term "protection" from infection or disease as used 
herein is intended "prevention," "suppression" or "treatment." 
"Prevention" involves administration of a GPR polypeptide, 
polypeptide derivative, or anti- idiotypic antibody prior to the 

20 induction of the disease. 

"Suppression" involves administration of the composition 
prior to the clinical appearance of the disease. 

"Treatment" involves administration of the protective 
composition after the appearance of the disease. It will be 

25 understood that in human and veterinary medicine, it is not always 
possible to distinguish between "preventing" and "suppressing" since 
the ultimate inductive event or events may be unknown, latent, or the 
patient is not ascertained until well after the occurrence of the 
event or events. Therefore, it is common to use the term 

3 0 "prophylaxis" as distinct from "treatment" to encompass both 
"preventing" and "suppressing" as defined herein. The term 
"protection," as used herein, is meant to include "prophylaxis." 

At least one GPR polypeptide, antibody or anti - idiotypic 
antibody of the present invention may be administered by any means 

35 that achieve their intended purpose, for example, to treat GPR 
related pathologies, such as psychotic disorders, including 
schizophrenia, by inhibition of binding of Dopamine D : receptors 
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using a GPR polypeptide corresponding to a fragment or consensus 
portion of a dopamine D 2 transmembrane domain; in the form of a 
pharmaceutical composition. 

For example, administration of such a composition may be by 
5 various parenteral routes such as subcutaneous, intravenous, 
intradermal, intramuscular, intraperitoneal, intranasal, transdermal, 
or buccal routes . Alternatively, or concurrently, administration may 
be by the oral route. Parenteral administration can be by bolus 
injection or by gradual perfusion over time. 
10 A preferred mode of using a GPR pharmaceutical composition 

of the present invention is by intravenous or parenteral application. 

A typical regimen for preventing, suppressing, or treating 
G-protein coupled receptor pathologies, such as dopamine receptor 
related schizophrenia, comprises administration of an effective 
15 amount of a GPR polypeptide, consensus sequence, or chemical 
derivative thereof, administered over a period of one or several 
days, up to and including between one week and about 24 months. 

It is understood that the dosage of a GPR polypeptide of 
the present invention administered in vivo or in vitro will be 
20 dependent upon the age, sex, health, and weight of the recipient, 
kind of concurrent treatment, if any, frequency of treatment, and the 
nature of the effect desired. The ranges of effective doses provided 
below are not intended to limit the inventors and represent preferred 
dose ranges. However, the most preferred dosage will be tailored to 
25 the individual subject, as is understood and determinable by one of 
skill in the art, without undue experimentation. 

The total dose required for each treatment may be 
administered by multiple doses or in a single dose. a GPR 
polypeptide or functional a chemical derivative thereof may be 
30 administered alone or in conjunction with other therapeutics directed 
to GPR related pathologies, such as a the dopamine receptor related 
pathology as a non limiting example, or directed to other symptoms 
of the disease. 

Effective duiuuiim Ot the a gpr polypeptide or composition, 

3 5 which may also include a functional derivative thereof, or a GPR 
anti- idiotypic antibody, are from about 0.01 ^g to about 100 mg/kg 
body weight, and preferably from abou~ 10 fig' to about 50 mg/kg body 
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weight, such 0.05, 0.07, 0.09, 0.1, 0.5, 0.7, 0.9, 1, 2, 5, 10, 20, 
25, 30, 40, 45, .or 50 mg/kg. 

Preparations for parenteral administration include sterile 
aqueous or non- aqueous solutions, suspensions, and emulsions, which 
5 may contain auxiliary agents or excipients which are known in the 
art. Pharmaceutical compositions such as tablets and capsules can 
also be prepared according to routine methods. 

Pharmaceutical compositions comprising at least one GPR 
polypeptide of the present invention may 

10 include all compositions wherein the GPR polypeptide is contained in 
an amount effective to achieve its intended purpose. In addition to 
the GPR polypeptide, a pharmaceutical composition may contain 
suitable pharmaceutically acceptable carriers, such as comprising 
excipients and auxiliaries which facilitate processing of the active 

15 compounds into preparations which can be used pharmaceutically. 

Pharmaceutical compositions include suitable solutions for 
administration intravenously, subcutaneous ly, dermally, orally, 
mucosally, rectally or may by injection or orally, and contain from 
about 0.01 to 99 percent, preferably from about 2C to 75 percent of 

20 active component (i.e. the antibody) together with the excipient. 
Pharmaceutical compositions for oral administration include tablets 
and capsules. Compositions which can be administered rectally 
include suppositories. 

Example 1: Synthesis of a G- Protein Transmembrane Polypeptide and 
25 Consensus Polypeptide 

The polypeptides in Figs. 1-5 were synthesized using the 
following procedure and include the following characteristics. 

Peptide I (SEQ ID NO:l) , as shown in Fig. l, was used as a 
control for hydrophobic interaction alone as the mechanism of binding 
3 0 and was run in parallel with the test polypeptides described below. 
Polypeptide II (SEQ ID NO:2), as shown in Fig. 2, represents a 
membrane -spanning fragment of transmembrane segment III in the 
dopamine D 2 receptor. This particular fragment was chosen since it 
has been implicated in the j3- adrenergic receptor as having many 
3 5 residues which are involved in iigand binding interaction. 
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Polypeptide III (SEQ ID N0:3), as shown in Fig. 3, represents the 
consensus polypeptide which was developed as a model for the dopamine 
D 2 system and polypeptide IV (SEQ ID NO:4), as shown in Fig. 4, is 
a control for length dependence to show how critical the polypeptide 
5 length is in binding studies. Polypeptide V (SEQ ID NO:5), as shown 
in Fig. 5, is a consensus sequence of transmembrane domains of 
dopamine receptors D x and D : . 

The above polypeptides I-V (SEQ ID NOS:l-5), as shown in 
Figs. 1-5, respectively, were synthesized using solid phase synthesis 

10 on a Milligen 9600 polypeptide synthesizer using Fmoc amino acids 
(provided by Milligen/Biosearch) and PAL polystyrene resin 
(Milligen/Biosearch) . Coupling times were 1 hour and the 
polypeptides were cleaved by t r i f 1 u o r o a c e t i c 
acid/phenol/H 2 0/thioanisole/ethanedithiol (82.5:5:5:5:2.5) at room 

15 temperature for 2 hours. The filtrate was collected and washed with 
2 mL of trif luoroacetic acid (TFA) and 1 mL of dichloromethane (DCM) . 
The filtrate was reduced in vacuo to 2 ml in volume and the resulting 
polypeptide was precipitated out by the addition of water. The 
polypeptides were then dissolved in 1, 1 , 1, 3 , 3 , 3 -hexafluoro- 2 -propanol 

20 [(HFIP) Eastman]; lyophilized; and stored at -20°C until 
purification. Polypeptides I-V (SEQ ID NOS:l-5) , were purified using 
reverse -phase HPLC using a preparative Vydac C4 column (Vydac) at 
60 °C at a flow rate of 6.0 mL/min with a linear gradient of 0-100% 
B in a GO min period at a UV detection wavelength of 2 75 nm. 

25 Due to the highly hydrophobic nature of tnese polypeptides, 

methanol was used with 0.1% (W/V) TFA and 0.5% (W/V) HFIP as solvent 
A and 2 -propanol with 0.1% TFA as solvent B, in order to purify these 
polypeptides. Further purification was performed with an analytical 
C4 column (Vydac) with an isocratic gradient of 40% B at a flow rate 

30 of I ml /min. Identity of the polypeptides was confirmed by Fast -atom 
bombardment mass spectrometry and electrospray mass spectrometry and 
amino acid analysis. Stock solutions of polypeptides were made in 
HFIP and stored at -20°- 80°C. 

C ircular Dichroiam (CD) — — Spectra Were rec orded on an Aviv 

3 5 model 60 DS circular dichroism spectrophotometer at room temperature 
with a l cm by 1 mm cell. The amplitude of the CD signal was 
calibrated using 1 0.1% (w/v) solution of d ( + ) -camphorsulf onic acid 
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(Aldrich) and the wavelength of the CD signal was set using standard 
absorbance peaks of benzene vapor. Polypeptide concentrations were 
determined in a Cary 210 UV spectrophotomer with the absorbance 
measured at 2 80 run. Helical content was estimated using CD signal 
5 intensity according to the method of Chen, et al Biochem . 13:3350- 
3359 (1974) . This calculation compares the experimental ellipticity 
at 222 nm ([6] 222) ([6]) to a theoretical [9] 222. The theoretical 
[9] 222 is empirically adjusted to account for differences in 
polypeptide length and is based on experimental CD data from a series 

10 of proteins with known crystal structures. Since both the curve 
shape and magnitude are important in analysis of a CD spectrum for 
secondary structure contributions, we also considered qualitatively 
the contributions to the spectral shapes from different secondary 
structures using reference curves for poly (L- lysine) . 

15 Fig. 6 shows a CD spectrum of the consensus polypeptide III 

(SEQ ID NO: 3) demonstrating that the polypeptide III is only 
partially helical in a solvent system in which most membrane 
polypeptides are strongly helical. 

Preparation of Small Unilamellar Vesicles . Polypeptides 

20 were incorporated into DMPC vesicles at lipidipeptide ratio of 147:1 
in the following manner: polypeptide in HFIP was mixed with 
dimyrystyroyl- phosphatidylcholine (synthetic) (DMPC) in dry 
chloroform and dried to a film with a stream of dry nitrogen at 0°C. 
This residue was then dried further overnight under a vacuum (1 x 10' 2 

25 torr) . The residue was then hydrated in 100 mM NaCl and sonicated 
for a 30-min period under nitrogen at 0°C. The suspension was 
sedimented for a 30-min at 100,000 g (4°C) to remove any residual 
titanium particles and large unilamellar vesicles. The supernatant 
was removed and sedimented once more at 159,000 g for a 45 min period 

30 at 4°C. The supernatant in the lower portion was used immediately. 
This basic procedure has been shown to reliably produce small 
unilamellar vesicles. 

Radioligand Binding Assays. A 0.50 mL volume of 1.00 nM 
[ 3 H] -spiperone {New England specific activity 21.4 Ci/mmol) was added 

3 5 to assay tubes which contained 0.5 mL lipid/peptide supernatant, 0.5 
mL Tris buffer pH 7.4 and 0.5 mL of cold drug for a final volume of 
2.0 mL. Nonspecific binding was defined in the presence of 1 uM of 
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(+) butaclamol or 1 uM spiperone. Appropriate controls for lipid 
vesicles containing no polypeptide were also run. Assay tubes were 
prepared in triplicate and the mixture was incubated for 1 h at 25 °C. 
Incubation was terminated by filtration through filters presoaked in 
5 0.1% polyethyleneimine (w/v, Sigma) for at- least 1 h prior to use. 

Filters were then washed with 6.0 mL of cold 50 mM Tris-HCl 
buffer, pH 7.40. For detection of radioactivity, filters were placed 
in 2.0 mL of scintillation fluid (Scintiverse) and incubated for 24 
h. The activity of the tritium was determined in a Beckman LS 7500 

10 liquid scintillation counter. Specific binding of [ 3 H] -spiperone was 
defined as the difference in binding in the presence and absence of 
unlabeled (+) butaclamol. 

Fig. 7 shows results of radioligand binding assays 
comparing polypeptide I (SEQ ID N0:1) as a control unit polypeptide 

15 III (SEQ ID NO: 3) according to the present invention. Polypeptide 
III (SEQ ID NO: 3) is shown to unexpectedly provide receptor- like 
functional binding, as demonstrated by binding to the neuroleptic 
agent, spiperone, into a stereoselective, concentration- dependent 
manner. 

20 It has also been demonstrated that as little as 0.1% of a 

GPR polypeptide according to the present invention is able to form 
a receptor- like functional binding site. Thus, a GPR polypeptide of 
the present invention is unexpectedly shown to act both as GPR 
ligands and GPR binding sites. 

25 All references cited herein, including journal articles or 

abstracts, published or corresponding U.S. ox foreign patent 
applications, issued U.S. or foreign patents, or any other 
references, are entirely incorporated by reference herein, including 
all data, tables, figures, and text presented in the cited 

30 references. Additionally, the contents of the ' references cited 
within the references cited herein are also entirely incorporated by 
reference . 

Reference to known method steps, conventional methods 

steps , — kuuwn methods Or 1 conventional metnods is not in any way an 

3 5 admission that any aspect, description or embodiment of the present 
invention is disclosed, taught or suggested in the relevant art. 
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The foregoing description of the specific embodiments will 
so fully reveal the general nature of the invention that others can, 
by applying knowledge within the skill of the art (including the 
contents of the references cited herein) , readily modify and/or adapt 
5 for various applications such specific embodiments, without undue 
experimentation, without departing from the generic concept of the 
present invention. Therefore, such adaptations and modifications are 
intended to be comprehended within the ■ meaning and range of 
equivalents of the disclosed embodiments, based on the teaching and 
10 guidance presented herein. It is to be understood that the 
phraseology or terminology herein is for the purpcse of description 
and not of limitation, such that the terminology or phraseology of 
the present specification is to be interpreted by the skilled artisan 
in light of the teachings and guidance presented herein. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Murphy, Randall B. 

Schuster, David I. 

5 (ii) TITLE OF INVENTION: POLYPEPTIDES OF G- COUPLED RECEPTOR PROTEINS, AND 

COMPOSITIONS AND METHODS THEREOF 
(iii) NUMBER OF SEQUENCES : 95 
<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BROWDY AND NEIMARK 
10 (B) STREET: 419 Seventh Street, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 
<E) COUNTRY: USA 
(F) ZIP: 20004 

15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
20 <vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/943,236 

(B) FILING DATE: 10-SEP-1992 

(C) CLASSIFICATION: 
(viii) ATTORNEY /AGENT INFORMATION: 

25 (A) NAME: Townsend, Kevin G. 

(B) REGISTRATION NUMBER: 34,033 

(C) REFERENCE /DOCKET NUMBER: MURPHY =2 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-628-5197 
30 <B) TELEFAX: 202-737-3528 

(C) TELEX: 248633 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

4 0 Leu Ser Leu Leu Leu Ser Leu Leu Ser Leu Leu Leu Ser Leu Leu Ser 

15 10 15 

Leu Leu Leu Ser Leu Tyr Tyr Tyr 
20 

(2) INFORMATION FOR SEQ ID NO : 2 : 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 0 (ix) MOLECULE TYPF : peptide 

(Xi) SaUUKNCE UKSCaiPTl-ON: SS 0 It) NO : 2 : 

Asp Asp lie Phe Val Thr Leu Asd Val Leu Phe Ser Thr Ala Ser lie 
1 5 " 10 15 



55 



Leu Asn Leu Ser Ala lie Ser Leu Lvs Lys Lvs 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp Tyr Ala lie Phe Val Leu Tyr Ala Ser Ala Trp Leu Ser Phe Asn 
10 1 5 10 15 

Cys Pro Phe lie Val Thr Leu Asn lie Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Lys Ala Val Val Tyr Ser Ser He Val Ser Phe Tyr Val Phe He Asp 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Asp Cys Asp Val Phe Val Phe Val Asp He Met Leu Cys Thr Ala Ser 
15 10 15 

He Phe Asn Leu Cys Ala He Ser Val Gly Lys 
20 25 

3 5 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

4 0 (D) TOPOLOGY: linear 

(ii) MOLECULE .TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Leu Val Leu Leu Leu Phe Ala Asp Phe Ser Ser Met Leu Gly Cys 
1 5 .10 15 

4 5 Met Ala Val Leu He Gly Phe Trp Arg Leu Lys Leu Leu Arg Asn His 

20 25 30 

Val Thr Lys Val He Ala Cys Phe Cys Ala Thr Ser Phe Cys Lys Asp 
35 40 45 

Phe Pro Ser Thr He Leu Thr Leu Thr Asn Thr Ala Val Asn Gly Gly 
50 50 55 60 



Phe Pro 

65 



Cys Tyr Leu Tyr Ala He Val He Thr Tyr Gly Ser Phe Ala 

7 0 75 80 
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Cys Trp Leu Trp Thr Leu lie Cys Leu Ala lie Ser lie Tyr Met Leu 
85 90 95 

lie Val Lys Arg Glu Pro Glu Pro Glu Leu Phe Glu Lys Tyr Tyr Tyr 

100 105 11.; 

5 Leu Leu Cys Trp Gly Leu Pro Leu lie Ser Thr lie Gly Leu Lys Asn 

115 120 125 

Thr Val Gin Phe Val Gly Asn Trp Cys Trp He Gly Val Ser Phe Thr 
130 135 140 

Gly Tyr Arg Phe Gly Leu Phe Tyr Pro Phe Leu Phe He Trp Ala He 
10 145 150 155 160 

Ser Ala Val Leu Val Gly Leu Thr Ser Arg Tyr Thr Tyr Trp He His 
165 170 175 

Asn Gly Val Ser Asp Asn Lys Glu Lys His Leu Thr Tyr Gin Phe Lys 
180 185 190 

15 Leu He Asn Tyr He He Val Phe Leu Val Cys Trp Val Phe Ala Val 

195 200 205 

Val Asn Arg He Val Asn Gly Leu Asn Trp Pro Pro Ala Leu Asn He 
210 215 220 

Leu His Thr Tyr Leu Ser Val Ser His Gly Phe Trp Ala Ser Val Thr 
20 225 230 235 240 

Phe He Tyr Asn Asn Pro Leu Met Trp Arg Tyr Phe Gly Ala Lys lie 
245 250 255 

Leu Thr Val Phe Thr Phe Phe Gly Tyr Phe Thr Asp Val Gin Lys Lys 
260 265 270 

25 Leu Glu Lys Asn Leu Ser Pro Tyr Ser Ser Ser Arg Gly Thr Ser Gly 

275 280 285 

Lys Thr Met Leu Gly His Pro Thr Gly Asp Asp Val Gin Cys Ser Ser 
290 295 300 

Asp Leu Gin Cys Ser Leu Glu Arg His Pro Asn Met Val 
30 305 310 315 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 

3 5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:7; 

Val Tyr He Thr Val Glu Leu Ala He Ala Val Leu Ala Thr Leu Gly 

40 l 5 10 15 

Asn Val Leu Val Cys Trp Ala Val Trp Leu Asn Ser Asn Leu Asn Val 
20 25 30 



Thr Asn Tyr Phe Val Val Ser Leu Ala Ala Ala Asp He Ala Val Gly 

35 40 45 

4 5 Val He Ala He Pro Phe Ala He Thr He Ser Thr Gly Phe Cys Ala 

50 55 60 

Ala Cys His Asn Cys Leu Phe Phe Ala Cys Phe Val Leu Val Leu Thr 
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65 70 75 80 

Gin Ser Ser He Phe Ser Leu Leu Ala He Ala He Asp Arg Tyr He 
85 90 95 

Ala He Arg He Pro Leu Arg Tyr Asn Gly Leu Val Thr Gly Thr Arg 
5 100 105 110 

Ala Lys Gly He He Ala Val Cys Trp Val Leu Ser Phe Ala He Gly 
115 120 125 

Leu Thr Pro Met Leu Gly Trp Asn Asn Cys Ser Gin Pro Lys Glu Gly 
130 135 140 

10 Arg Asn Tyr Ser Gin Gly Cys Gly Glu Gly Gin Val Ala Cys Leu Phe 

145 150 155 160 

Glu Asp Val Val Pro Met Asn Tyr Met Val Tyr Tyr Asn Phe Phe Ala 
165 170 175 

Phe Val Leu Val Pro Leu Leu Leu Val Tyr Leu Arg He Phe Leu Ala 
15 180 185 190 

Ala Arg Arg Gin Leu Lys Gin Met Glu Ser Gin Pro Leu Pro Gly Glu 
195 200 205 

Arg Ala Arg Ser Thr Leu Gin Lys Glu Val His Ala Ala Lys Ser Ala 
210 215 220 

20 He He Val Gly Leu Phe Ala Leu Cys Trp Leu Pro Leu His He He 

225 230 235 240 

Asn Cys Phe Thr Phe Phe Cys Pro Glu Cys Ser His Ala Pro Leu Trp 
245 250 255 

Leu Met Tyr Leu Thr He Val Leu Ser His Thr Asn Ser Trp Asn Pro 
25 260 265 270 

Phe He Tyr Ala Tyr Arg He Arg Glu Phe Arg Gin Thr Phe Arg Lys 
275 280 285 

lie He Arg Ser His Val Leu Arg Arg Arg Glu Pro Phe Lys Ala Gly 
290 295 300 

3 0 Gly Thr Ser Ala Arg Ala Leu Ala Ala His Gly Ser Asp Gly Glu Gin* 

305 310 315 320 

He Ser Leu Arg Leu Asn Gly His Pro Pro Gly Val Trp Ala Asn Gly 
325 330 335 

Ser Ala Pro His Pro Glu Arg Arg Pro Asn Gly Tyr Thr 
35 340 345 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 

4 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ala Tyr He Gly He Glu Val Leu He Ala Leu Val Ser Val Pro Gly 
45 l 5 10 15 

Trp Leu Val He Trp Ala Val Lys Val Asn Gin Ala Leu Arg Asp Ala 
20 25 30 
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10 



15 



20 



25 



30 



Thr Phe Cys Phe He Val Ser lie Ala Val Ala Asp Val Ala Val Gly 
35 40 " 45 

Ala Leu Val He Pro Leu Ala He Leu He Asn He Gly Pro Arg Thr 
50 55 60 

Tyr Phe His Thr Cys Leu Met Val Ala Cys Pro Val Leu He Leu Thr 
65 70 75 80 

Gin Ser Ser He He Ala Leu Leu Ala He Ala Val Asp Arg Tyr Leu 
85 90 95 

Arg Val Lys He Pro Leu Arg Tyr Lys Thr Val Val Thr Pro Arg Arg 
100 105 110 

Ala Ala Val Ala He Ala Gly Cys Trp He Leu Ser Phe Val Val Gly 
115 120 125 

Leu Thr Pro Leu Phe Gly Trp Asn Arg Leu Gly Glu Ala Gin Arg Ala 
130 135 140 

Trp Ala Ala Asn Gly Ser Gly Gly Glu Pro Val He Lys Cys Glu Phe 
145 150 155 160 

Glu Lys Val He Ser Met Glu Tyr Met Val Tyr Phe Asn Phe Phe Val 
165 170 175 

Trp Val Leu Pro Pro Leu Leu Leu Met Val Leu He Tyr Leu Glu Val 
180 185 190 

Phe Tyr Leu He Arg Arg Gin Leu Gly Lys Lys Val Ser Ala Ser Ser 
195 200 205 

Gly Asp Pro Gin Lys Tyr Tyr Gly Lys Glu Leu Lys He Ala Lys Ser 
210 215 220 

Leu Ala Leu He Leu Phe Leu Phe Ala Leu Ser Trp Leu Pro Leu His 
225 230 235 240 

He. He Asn Cys He Thr Leu Phe Cys Pro Ser Cys Arg Lys Pro Ser 
245 250 255 

He Leu Met Tyr He Ala He Phe Leu Thr His Gly Asn Ser Ala Met 
260 265 270 

Pro He Val Tyr Ala Phe Arg He Gin Lys Phe Arg Val Thr Phe Leu 
275 280 285 

Lys He Trp Asn Asp His Phe Arg Cys Gin Pro Thr Pro Pro Val Asp 

290 295 300 

Glu Asp Pro Pro Glu Glu Ala Pro His Asp 



INFORMATION FOR SEQ ID NO : 9 : 
(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :" 34 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLO GY: linear 

Hi J MOLECULE TYPE: peptide 

(xii SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Val Ala Phe He Gly He Thr Thr Gly Leu Leu Ser He Ala Thr Val 
1 5 10 15 

Thr Gly Asn Leu Leu Val Leu He Ser Phe Lys Val Asn Thr Glu Leu 



305 



. 310 
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20 25 30 

Lys Thr Val Asn Asn Tyr Phe Leu Leu Ser lie Ala Cys Ala Asp Leu 
35 40 . 45 

He He Gly Thr Phe Ser Met Leu Tyr Leu Leu Met His Trp Ala Leu 
5 50 55 60 

Gly Thr Leu Ala Cys Asp Leu Trp Leu Ala Leu Asp Tyr Val Ala Ser 
65 70 75 80 

Asn Ala Ser Val Leu Asn Leu Leu Leu lie Ser Phe Asp Arg Tyr Phe 
85 90 95 

10 Ser Val Thr Arg Pro Leu Ser Tyr Arg Ala Lys Arg Thr Pro Arg Arg 

100 105 HO 

Ala Ala He Met He Gly He Ala Trp Leu Val Ser Phe Val Leu Trp 
115 120 125 

Ala Pro Ala He Leu Phe Trp Gin Tyr Leu Val Gly Glu Arg Thr Met 
15 130 135 140 

Leu Ala Gly Gin Cys Tyr He Gin Phe Leu Ser Gin Pro He lie Thr 
145 150 155 160 

Phe Gly Thr Ala Met Ala Ala Phe Tyr Met Pro Val Thr Vai Met Thr 
1S5 170 175 

2 0 Leu Tyr Trp Arg He Tyr Arg Phe Thr Glu Asn Arg Ala Arg Glu Leu 

180 185 190 

Gin Gly Ser Glu Thr Pro Gly Lys Gly Gly Gly Ser Ser Ser Ser Ser 
195 200 205 

Glu Arg Ser Gin Pro Gly Ala Glu Gly Ser Pro Glu Thr Pro Lys Gly 
25 210 215 220 

Gin Lys Pro Arg Gly Lys Glu Leu Ala Lys Arg Lys Thr Phe Ser Leu 
225 230 235 240 

Val Lys Glu Lys Lys Ala Ala Arg Thr Leu Ser Ala He Leu Leu Ala 
245 250 255 

3 0 Phe He Leu Thr Trp Thr Pro Tyr Asn lie Met Val Leu Val Ser Thr 

260 265 270 

Phe Cys Lys Asp Cys Val Pro Glu Thr Leu Trp Glu Leu Gly Tyr Trp 
275 280 285 

Leu He Cys Tyr Val Asn Ser Thr He Asn Pro Trp Tyr Ala Leu Cys 
35 290 295 300 

Asn Lys Ala Phe Arg Asp Thr Phe Arg Leu Leu Leu Leu Cys Trp Asp 
305 310 315 320 

Lys Arg Arg Trp Arg Lys He Pro Lys Arg Pro Gly Ser Val His Arg 
325 330 335 

4 0 Thr Pro Ser Arg Gin Cys 

340 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 
4 5 (B) TYPE: amino acid 

. (C) STRANDEDNESS : single 
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10 



15 



20 



25 



30 



35 



(D) TOPOLOGY: linear 
di) MOLECULE TYPE: peptid 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Val Phe He Val Leu Val Ala Gly Ser Leu Ser Leu Val Thr He 
1 '5 10 15 

He Gly Asn He Leu Val Met Val Ser He Lys Val Asn Arg His Tyr 
20 25 30 

Phe Leu Phe Ser He Ala Cys Ala Asp Leu He He Gly Val Phe Ser 
35 40 45 

Met Asn Leu Tyr Thr Leu Tyr Thr Val He Gly Tyr Trp Pro Leu Gly 
50 55 60 

Pro Val Val Cys Asp Leu Tyr Val Val Ser Asn Ala Ser Val Met Asn 
65 70 75 80 

Leu Leu He He Ser Phe Asp Arg Tyr Phe Cys Val Thr Lys Pro Leu 
85 90 95 

Thr Tyr Pro Val Lys Arg Thr Thr Lys Met Ala Gly Met Met He Ala 
100 105 110 

Ala Ala Trp Val Leu Ser Phe He Leu Trp Ala Pro Ala He Leu Phe 
H5 120 125 

Trp Gin Phe He Val Gly Val Arg Thr Val Glu Asp Gly Glu Cys Tyr 
130 135 140 

He Gin Phe Phe Ser Asn Pro Ala Val Thr Phe Gly Thr Ala He Ala 
14 5 150 155 160 

Ala Phe Tyr Leu Pro Val He He Met lie Val Leu Tyr Trp His lie 
165 170 175 

Ser Arg Ala Ser I.ys Ser Arg He Lys Lys Asp Lys Lys Glu Pro Val 
180 185 190 

Ala Asn Gin Asp Pro Val Ser Pro Ser Leu Val Gin Gly Arg He Val 
195 200 205 

Lys Pro Leu Ser Ser Asp Asp Lys He Val Arg Arg Thr Lys Gin Pro 
210 215 220 

Ala Lys Lys Lys Pro Pro Pro Ser Arg Glu Lys Lys Val Thr Arg Thr 
225 230 235 240 

He Ala He Leu Leu Ala Phe He He Thr Trp Ala Pro Tyr Asn Val 
245 250 255 

Met Val Leu He Asn Thr Phe Cys Ala Pro Cys He Pro Asn Thr Val 
260 265 270 

Trp Arg He Gly Tyr Trp Leu Cys Tyr He Asn Ser Thr He Asn Pro 
275 280 285 

Ala Cys Tyr Ala Leu Cys Asn Ala Thr Phe Lys Lys Thr Phe Lys His 
290 295 300 



Leu He Met Cys His Tyr Lys Asn He Gly Ala Thr Arg 



305 



310 



315 



(2) INFORMATION FOR SEQ ID NO: 11: 
4 5 U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 55 ammo acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide . . 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Trp Phe He Ala Phe Leu Thr Gly He Leu Ala Leu Val Thr He He 
15 10 15 

Gly Asn He Leu Val He Val Ser Phe Lys Val Asn Lys Gin Leu Lys 
20 25 30 

10 Thr Val Asn Asn Tyr Phe Leu Leu Ser Leu Ala Cys Ala Asp Leu He 

35 40 45 

He Gly Val He Ser Met Asn Leu Phe Thr Thr Tyr He He Met Asn 
50 55 60 

Arg Trp Ala Leu Gly Asn Thr Ala Cys Asp Leu Trp He Ala He Asp 
15 65 70 75 80 

Tyr Val Ala Ser Asn Ala Ser Val Leu Asn Leu Leu Val He Ser Phe 
85 90 95 

Asp Arg Tyr Phe Ser lie Thr Arg Pro Leu Thr Tyr Arg Ala Lys Arg 
100 105 110 

20 Thr Thr Lys Arg Ala Gly Val Met lie Gly Leu Ala Trp Va_ lie Ser 

115 120 125 

Phe Val Leu Trp Ala Pro Ala He Leu Phe Trp Gin Tyr Phe Val Gly 
130 135 140 

Lys Arg Thr Val Pro Pro Gly Glu Cys Phe lie Gin Phe Leu Ser Glu 
25 145 150 155 160 

Pro Thr He Thr Phe Gly Thr Ala lie Ala Ala Phe Tyr Met Pro Val 
165 170 175 

Thr He Met Arg He Leu Tyr Trp Arg He Tyr Lys Glu Thr Glu Lys 
180 185 190 

30 Arg Thr Lys Glu Leu Ala Gly Leu Gin Ala Ser Gly Thr Glu Ala Glu 

195 200 205 

Thr Glu Asn Phe Val His Pro Thr Gly Ser Ser Arg Ser Cys Ser Ser 

210 215 220 . 

Tyr Glu Leu Gin Gin Gin Lys Arg Phe Ala Leu Lys Thr Arg Ser Gin 
35 225 230 235 240 

He Thr Lys Arg Lys Leu Leu Val Lys Glu Lys Lys Ala Ala Gin Thr 
245 250 255 

Leu Ser Ala lie Leu Leu Ala Phe lie lie Thr Trp Thr Pro Tyr Asn 
260 265 270 

4 0 lie Met Val Leu Val Asn Thr Phe Cys Asp Ser Cys lie Pro Lys Thr 

275 280 285 

Tyr Trp Asn Leu Gly Gly Tyr Trp Leu Cys Tyr lie Asn Ser Thr Val 
290 295 300 

Asn Pro Val Cys Tyr Ala L u Cys Asn Lys Thr Phe Arg Thr Thr Phe 
45 305 310 315 320 

Lys Thr Leu Leu Leu Cys Gin Cys Asp Lys Arg Lys Arg Arg Lys Gin 
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325 330 335 

Gin Tyr Gin Gin Arg Gin Ser Val lie Phe His Lys Arg Val Pro Glu 
340 345 350 

Gin Ala Leu 
5 355 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 333 amino acids 
(B) TYPE: amino acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



20 



25 



30 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
""^ "" , -* " " - " — ser 

10 15 



Met Val Phe He Ala Thr Val Arg Gly Ser Leu Ser Leu Val Thr Val 

15 1 5 m „r 



Val Gly Asn He Leu Val Met Leu Ser He Lys Val Asn Arg Gin Leu 

20 25 30 

Gin Thr Val Asn Asn Tyr Phe Leu Phe Ser He Ala Cys Ala Asp Leu 
35 40 45 

He He Gly Ala Phe Ser Met Asn Leu Tyr Thr Val Tyr He He Lys 
50 55 60 

Gly Tyr Trp Pro Lau Gly Ala Trp Cys Asp Leu Trp Leu Ala Leu Asp 
65 70 75 80 

Tyr Val Val Ser Asn Ala Ser Val Met Leu Leu He He Ser Phe Asp 
85 90 95 

Arg Tyr Phe Cys Val Thr Lys Pro Leu Thr Tyr Pro Ala Arg Arg Thr 
100 105 m 

Thr Lys Met Ala Gly He Met lie Ala Ala Ala Trp Val Leu Ser Phe 
H5 120 125 

Val Leu Trp Ala Pro Ala He Leu Phe Trp Gin Phe Val Val Gly Lvs 
130 135 140 

Arg Thr Val Pro Asp Asn Gin Cys Phe He Gin Phe Leu Ser Asn Pro 
145 150 155 160 

Ala Val Thr Phe Gly Thr Ala lie Ala Ala Phe Tyr Leu Pro Val Val 
35 165 170 175 

He Met He Val Leu Tyr lie His lie Ser Leu Ala Ser Arg Ser Arg 
ISO 185 190 

Val His Lys His Arg Pro Glu Gly Pro Lys Glu Lys Lys Ala Lys Thr 
!95 200 205 

lie Ala Phe Leu Ser Pro lie Met Gin Ser Val Lys Lys Pro Pro 

210 215 220 

Pro Gly Glu Ala Lys Phe Ala Ser lie Ala Arg Asn Gin Val Arq Lvs 
225 775 * 240 

Lys Arg Gin Leu Ala Ala Arg Glu Arg Lys Val Thr Arg Thr lie Phe 
245 250 255 

Ala lie Leu Leu Ala Phe He Leu' Thr Trt> Thr Pro Tyr Asn Val Met 
260 265 270 
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Val Leu Val Asn Thr Phe Cys Gin Ser Cvs He Pro Asp Thr Val Trp 
275 280 " 285 

Ser He Gly Tyr Trp Leu He Cys Tyr Val Asn Ser Thr He Asn Pro 
290 295 300 

5 Ala Cys Tyr Ala Leu Cys Asn Ala Thr Phe Lys Lvs Thr Phe Arg His 

3 °5 310 315 " 320 

Leu Leu Leu Cys Gin Arg Tyr Asn He Gly Thr Ala Arg 
325 330 

(2) INFORMATION FOR SEQ ID NO: 13: 
10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
15 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Val He Thr He Ala Val Val Thr Ala Val Val Ser Leu Met Thr He 
1 5 10 is 

~ n Val G1 V Asn v *l Leu Val Met He Ser Phe Lys Val Asn Ser Gin Leu 

20 20 25 30 

Lys Thr Val Asn Asn Tyr Tyr Leu Leu Ser He Ala Cys Ala Asp Leu 
35 40 45 

He He Gly He Phe Ser Met Asn Leu Tyr Thr Thr Tyr He Leu He 
50 55 60 

Met Gly Arg Trp Ala Leu Gly Ser Leu Ala Cys Asp Leu Trp Leu Ala 

65 70 75 80 

lie Asp Tyr Val Ala Ser Asn Ala Ser Val Leu Asn Leu Leu Val He 
85 90 95 

Ser Phe Asp Arg Tyr Phe Ser He Thr Arg Pro Leu Thr Tyr Arg Ala 
100 105 no 

Lys Arg Thr Pro Lys Arg Ala Gly He Met He Gly He Ala Trp Leu 
115 120 125 

He Ser Phe He Leu Trp Ala Pro Ala He Leu Cys Trp Gin Tyr Leu 
130 135 140 

Val Gly Lys Arg Thr Val Pro He Asp Glu Cys Gin He Gin Phe Leu 
145 150 155 160 

Ser Glu Pro Thr He Thr Phe Gly Thr Ala He Ala Ala Phe Tyr lie 
165 170 175 

Pro Val Ser He Met Arg He Leu Tyr Cys Arg He Tyr Arg Glu Thr 
180 185 190 

Glu Lys Arg Thr Lys Asp Leu Ala Asp Leu Gin Gly Ser Asp Ser Val 
195 200 205 

Tyr Lys Ala Glu Lys Arg Lys Pro Ala His Arg Ala Leu Phe Arq Ser 
210 215 220 

Cys Leu Arg Cys Pro Arg Pro Thr Lys Gly Leu Asn Pro Asn Pro Se- 
225 23 ° 235 240 

His Gin Met Thr lys Arg Lys Arg Met Ser Leu Val Lys Glu Arg Lys 



25 



30 



35 



40 



45 
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245 250 255 

Ala Ala Gin Thr Leu Ser Ala lie Leu Leu Ala Phe lie lie Thr Trp 
260 265 27'^ 

Thr Pro Tyr Asn lie Met Val Leu Val Ser Thr Phe Cys Asp Lys Cys 
5 275 280 285 

Val Pro Val Thr Leu Trp His Leu Gly Tyr Trp Leu Cys Tyr lie Asn 
290 295 300 

Ser Thr Val Asn Pro lie Cys Tyr Ala Leu Cys Asn Arg Thr Phe Arg 
305 310 315 320 

10 Lys Thr Phe lie Met Leu Leu Cys Arg Trp Lys Lys Lys Lys Val Glu 

325 330 335 

Glu Lys Leu Tyr Trp Gin Gly Asn Ser Lys Leu Pro 
340 345 

(2) INFORMATION FOR SEQ ID NO: 14: 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Thr Ala Gly Asp Cys Leu He Met Leu He Val Leu Leu He Val Ala 
15 10 15 

Gly Asn Val Leu Val He Val Ala He Ala Lys Thr Pro Arg Leu Gin 
25 20 25 30 

Thr Leu Thr Asn Leu Phe He Met Ser He Ala Ser Ala Asp Leu Val 
35 40 45 

Met Leu Leu Leu Val Val Pro Phe Cys Ala Thr Leu Val Val Trp Gly 
50 55 60 

3 0 Arg Trp Glu Tyr Gly Ser Phe Phe Cys Glu Leu Trp Thr Ser Val Asp 

65 70 75 80 

Val Leu Cys Val Thr Ala Ser He Glu Thr Leu Cys Val lie Ala Leu 
85 90 95 

Asp Arg Tyr Leu Ala He Thr Ser Pro Phe Arg Tyr Gin Se?- Leu Leu 
35 ioo 105 110 

Thr Arg Ala Arg Ala Arg Gly Leu Val Cys Thr Val Trp Ala He Ser 
115 120 * 125 

Ala Leu Val Ser Phe Leu Pro He Leu Leu Ser Asp Glu Ala Arg Arg 
130 135 140 



40 



Cys Tyr Asn Asp Pro Lys Cys Cys Asp Phe Val Thr Asn Arq Ala Tyr 
145 150 155 160 

Ala He Ala Ser Ser Val Val Ser Phe Tyr Val Pro Leu Cys He Met 
L£S T75 



Phe Val Tyr Leu Arg Val Phe Arg Glu Ala Gin Lvs Gin Val Lys Lvs 
45 180 185 - 190 



He Asp Ser Cys Glu Ara Arg Phe Leu Glv Gly Pro Ala Arg Pro Pro 
195 200 " 205 
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Ser Pro Ser Pro Ser Pro Val Pro Ala Pro Ala Pro Pro Gly Pro Pro 
210 215 220 

Arg Pro Ala Ala Ala Ala Ala Thr Ala Pro Leu Ala Asn Glv Arg Ala 
225 230 235 240 

Gly Lys Arg Arg Pro Ser Arg Leu Val Ala Leu Arg Glu Gin Lvs Ala 
245 250 255 

Leu Lys Thr Leu Gly lie He Met Gly Val Phe Thr Leu Cys Trp Leu 
260 265 270 

Pro Phe Phe His Arg Glu Leu Val Pro Asp Arg Leu Phe Val Phe Phe 
275 280 285 

Asn Trp Leu Arg Tyr Ala Asn Ser Ala Phe Asn Pro He He Tyr Cys 
290 295 300 

Arg Ser Pro Asp Phe Arg Lys Ala Phe Gin Gly Leu Leu Cys Cys Ala 
305 310 315 320 

Arg Arg Ala Ala Arg Arg Arg His Ala Thr His Gly Asp Arg Pro Arg 
325 330 335 

Ala Ser Gly Cys He Ala Arg Pro Gly Pro Pro Ser Pro Gly Ala Ala 
340 345 350 

Ser Asp Asp Asp Asp Asp Asp Val Val Gly Ala Thr Pro Pro Ala Arg 
355 360 365 

Leu Leu Glu Pro Trp Ala Gly Cys Asn 
370 375 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Val Val Gly He Val Met Ser Leu He Val Leu Ala He Val Phe Gly 
15 10 15 

Asn Val Leu Val He Thr Ala He Ala Lys Phe Glu Arg Leu Gin Thr 
20 25 30 

Val Thr Asn Tyr Phe He Thr Ser He Ala Cys Ala Asp Leu Val Met 
35 40 45 

Gly Leu Ala Val Val Pro Phe Gly Ala Ala His He Leu Met Lys Met 
50 55 60 

Trp Thr Phe Gly Asn Phe Trp Cys Glu Phe Trp Thr Ser He Asp Val 

65 7 ° 75 80 

Leu Cys Val Thr Ala Ser He Glu Thr Leu Cys Val He Ale Val Asp 
85 90 95 

Arg Tyr Phe Ala He Thr Ser Pro Phe Lys Tyr Gin Ser Leu Leu Thr 
100 105 no ■ 

Lys Asn Lys Ala Arg Val He He lie Met Val Trp He Val Ser Gly 
115 120 125 

Leu Thr Ser Phe Leu Pro He Leu Tyr Arg Ala Thr His Gin Glu Ala 
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130 135 140 

lie Asn Cys -Tyr Ala Asn Glu Thr Cys Cys Asp Phe Phe Thr Asn Gin 
!45 150 155 160 

Ala Tyr Ala Ala Ser Ser Ala Val Ser Phe Tyr Val . Pro Leu Val He 
5 165 170 175 

Met Val Phe Val Tyr Ser Arg Val Phe Gin Glu Ala Lys Arg Gin Leu 
180 185 190 

Gin Lys He Asp Lys Ser Glu Gly Arg Phe He Phe Val Gin Asn Leu 
195 200 205 

10 Ser Gin Val Glu Gin Asp Gly Arg Thr Gly His Gly Leu Arg Arg Ser 

210 215 220 

Ser Lys Phe Cys Leu Lys Glu His Lys Ala Leu Lys Thr Leu Gly He 
225 230 235 240 

He Pro Cys Thr Phe Thr Leu Cys Trp Leu Pro Phe Phe He Val Asn 
15 245 250 255 

lie Val Val He Gin Asp Asn Leu He Arg Lys Glu Val Tyr He Leu 
260 265 270 

Leu Asn Trp He Gly Tyr Val Asn Ser Gly Phe Asn Pro Leu lie Tyr 
275 280 285 

2 0 Cys Arg Ser Pro Asp Phe Arg lie Ala Phe Gin Glu Leu Leu Cys Leu 

290 295 300 

Arg Arg Ser Ser Leu Lys Ala Tyr Gly Asn Gly Tyr Ser Ser Asn Gly 
305 310 315 320 

Asn Thr Gly Glu Gin Ser Gly Tyr His Val Glu Gin Glu Lys Glu Asn 
25 325 330 335 

Lys Leu Leu Cys Glu Asp Leu Pro Gly Thr Glu Asp Phe Val Gly His 
340 345 35') 

Gin Gly Thr Val Pro Ser Asp Asn lie Asp 
355 360 

3 0 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
3 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ala Ala Leu Ala Gly Ala Leu Leu Ala Leu Ala Val Leu Ala Thr Val 
1 5 10 15 

40 Gly Gly Asn Leu Leu Val He Val Ala lie Ala Trp Thr Pro Arg Leu 

20 25 30 

Gin Thr Met Thr Asn Val Phe Val Thr Ser Leu Ala Al a Ala Asp Leu 
3-5 TS 

Asp Leu Leu Val Val Pro Pro Ala Ala Thr Leu Ala Leu Thr Gly His 
4 ^ 50 55 60 

Trp Pro Leu Gly Ala Thr Gly Cys Glu Leu Trp Thr Ser Val Asp Val 
65 70 75 80 
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Leu Cys Val Thr Ala Ser lie Glu Thr Leu Cys Ala lie Ala Val Asp 
85 90 95 

Arg Tyr Leu Ala Val Thr Asn Pro Leu Arg Tyr Gly Ala Leu Val Thr 
100 105 110 

5 Lys Arg Cys Ala Arg Thr Ala Trp Leu Val Trp Val Val Ser Ala Ala 

115 120 125 

Val Ser Phe Ala Pro lie Met Ser Gin Trp Trp Arg Val Gly Ala Asp 
130 135 140 

Ala Glu Ala Gin Arg Cys His Ser Asn Pro Arg Cys Cys Ala Phe Ala 
10 145 150 155 160 

Ser Asn Met Pro Tyr Ala Val Leu Leu Ser Ser Ser Val Ser Phe Tyr 
165 170 175 

Leu Pro Leu Leu Leu Phe Val Tyr Ala Arg Val Phe Trp Ala Thr Arg 
180 185 190 

15 Gin Leu Arg Leu Leu Arg Gly Glu Leu Gly Arg Phe Pro Pro Glu Glu 

195 200 205 

Ser Pro Pro Ala Pro Ser Arg Ser Leu Ala Pro Ala Pro Val Gly Thr 
210 215 220 

Gly Ala Pro Pro Glu Gly Val Pro Ala Cys Gly Arg Pro Pro Ala Arg 
20 225 230 235 240 

Leu lie Pro lie Arg Glu His Arg Ala Leu Cys Thr Leu Gly Leu lie 
245 250 255 

Met Gly Thr Phe Thr Leu Cys Trp Leu Pro Phe Phe lie Ala Asn Val 

260 265 270 

2 5 Leu Arg Ala Leu Gly Gly Pro Ser Leu Val Pro Gly Pro Ala Phe Leu 

275 280 285 

Ala Leu Asn Trp Leu lie Gly Tyr Ala Asn Ser Ala Phe Asn Pro Leu 
• 290 295 300 

lie Tyr Cys Arg Ser Pro Asp Phe Arg Ser Ala Phe Arg Arg Leu Leu 
30 305 310 315 320 

Cys Arg Cys Gly Arg Arg Leu Pro Pro Glu Pro Cys Ala Ala Ala Arg 
325 330 335 

Pro Ala Leu Phe Pro Ser Gly Val Pro Ala Ala Glu Ser Ser Pro Ala 
340 345 350 

3 5 Gin Pro Arg Leu Cys Gin Arg Leu Asp Gly 

355 360 

(2) INFORMATION FOR SEQ ID NO: 17: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

4 0 <B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
4 5 Ala lie Leu Leu Gly Val lie Leu Gly Gly Leu lie Leu Phe Gly Val 

15 10 15 

Leu Gly Asn He Leu Val He Leu Ser Val Ala Cys His Arg His Leu 
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20 25 30 

His Ser Val Thr His Tyr Tyr lie Val Asn Leu Ala Val Ala Asp Leu 
35 40 45 

Leu Leu Thr Ser Thr Val Leu Pro Phe Ser Ala He Phe Glu He Leu 
5 50 55 60 

Gly Tyr Trp Lys Phe Gly Arg Val Phe Cys Asn Val Trp Ala Ala Val 
65 70 75 80 

Asp Val Leu Cys Cys Thr Ala Ser He Met Leu Leu Cys He He Ser 
85 90 95 

10 He Asp Arg Tyr He Gly Val Ser Tyr Pro Leu Arg Tyr Pro Thr He 

100 105 110 

Val Thr Gin Lys Arg Gly Leu Met Ala Leu Leu Cys Val Trp Ala Leu 
115 120 125 

Ser Leu Val He Ser He Gly Pro Leu Phe Gly Trp Arg Gin Pro Ala 
15 130 135 140 

Pro Glu Asp Glu Thr He Cys Gin He Asn Glu Glu Pro Gly Tyr Val 
145 150 155 160 

Leu Phe Ser Ala Leu Gly Ser Phe Tyr Val Pro Leu Thr He He Leu 
165 170 175 

2 0 Val Met Tyr Cys Arg Val Tyr Val Val Ala Lys Arg Glu Ser Arg Gly 

180 185 19'^ 

Leu Lys Ser Gly Leu Lys Thr Asp Lys Ser Asp Ser Glu Gin Val Thr 
195 200 205 

Leu Arg He His Arg Lys Asn Ala Gin Val Gly Gly Ser Gly Val Thr 
25 210 215 220 

Ser Ala Lys Asn Lys Thr His Phe Ser Val Arg Leu Leu Lys Phe Ser 
225 230 235 240 

Arg Glu Lys Lys Ala Ala Lys Thr Leu Gly He Val Val Gly Cys Phe 
245 250 255 

3 0 Val Leu Cys Trp Leu Pro Phe Phe Leu Val Met Pro He Gly Ser Phe 

260 265 270 

Phe Pro Asp Phe Arg Pro Ser Glu Thr Val Phe Lys He Ala Phe Trp 
275 280 285 

Leu Gly Tyr He Asn Ser Cys He Asn Pro He He Tyr Pro Cys Ser 
35 290 295 300 

Ser Gin Glu Phe Lys Lys Ala Phe Gin Asn Val Leu Arg He Gin Cys 
305 310 315 320 

Leu Arg Arg Lys Gin Ser Ser Lys His Thr Leu Gly Tyr Thr Leu His 
325 330 335 

40 



Ala Pro Ser His Val Leu Glu Gly Gin His Lys Asp Leu Val Arg He 
340 345 ! ISO 

Pro Val Gly Ser Ala Glu Thr Phe Tyr Lys He Ser Lys Thr Asp Gly 
355 360 365 

Val Cvs Glu Trp Lys lie Phe 
45 370 375 



WO 94/05695 



PCI7US93/08528 



- 70 - 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS: single . 

(D> TOPOLOGY: linear 
(ii) MOLECULE TSfPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ala lie Ser Val Gly Leu Val Leu Gly Ala Phe He Leu Phe Ala He 
10 1 5 10 15 

Val Gly Asn He Leu Val He Leu Ser Val Ala Cys Asn Arg His Leu 
20 25 30 

Arg Thr Pro Thr Asn Tyr Phe He Val Asn He Ala He Ala Asp Leu 
35 40 45 

15 Leu Leu Ser Phe Thr Val Leu Pro Phe Ser Ala Thr Leu Glu Val Leu 

50 55 60 

Gly Tyr Trp Val Leu Gly Arg He Phe Cys Asp He Trp Ala Ala Val 
65 70 75 80 

Asp Val Leu Cys Cys Thr Ala Ser He Leu Ser Leu Cys Ala He Ser 
20 85 90 95 

He Asp Arg Tyr He Gly Val Arg Tyr Ser Leu Gin Tyr Pro Thr Leu 
100 105 110 

Val Thr Arg Arg Tyr Ala He He Ala Leu Leu Ser Val Trp Val Leu 
115 120 125 

25 Ser Thr Val He Ser He Gly Pro Leu Leu Gly Trp Lys Glu Pro Ala 

130 135 140 

Pro Asn Asp Asp Lys Glu Cys Val Thr Glu Glu Pro Phe Leu Phe Cys 
145 150 155 160 

Ser Leu Gly Ser Phe Tyr He Pro He Ala Val He Leu Val Met Tyr 
30 165 170 175 

Cys Arg Val Tyr He Val Ala Lys Arg Thr Thr Lys Asn Leu Glu Ala 
180 185 190 

Gly Val Met Lys Glu Met Ser Asn Ser Lys Phe Leu Thr Leu Arg He 
195 200 205 

35 His Trp Ser Lys Asn Phe His Glu Asp. Thr Leu Ser Ser Thr Lys Ala 

210 215 220 

Lys Gly His Asn Pro Arg Ser Ser He Ala Val Lys Leu Phe Lys Phe 
225 230 235 240 

Ser Arg Glu Lys Lys Ala Ala Lys Thr Leu Gly He Val Val Gly Trp 
40 245 250 255 

He Leu Cys Trp Leu Pro Phe Phe He Ala Leu Pro Leu Gly Ser Leu 
260 265 270 

Phe Ser Thr Leu Lys Pro Pro Asp Ala Val Phe Lys Trp Phe Trp Leu 
275 280 285 

45 Gly Tyr Phe Asn Ser Cys Leu Asn Pro He He Tyr Pro Cys Ser Ser 

290 295 300 

Lys Glu Phe Lys Arg Ala Leu Leu Gly Cys Gin Cys Arg Gly Gly Arg 
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305 310 315 ' 320 

Arg Arg Arg Arg Arg Arg Arg Leu Ala Cys Ala Tyr Thr Tyr Arg Pro 
325 330 335 

Trp Thr Arg Gly Gly Ser Leu Glu Arg Ser Gin Ser Arg Lys Asp Ser 
5 340 345 350 

lie Asp Asp Ser Gly Ser Cys Met Ser Gly Gin Lys Arg Thr Leu Pro 
355 360 365 

Ser Ala 

370 

10 (2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 





Val 


Ala 


Gly 


Leu 


Ala 


Ala 


Val 


Val 


Gly 


Phe 


Leu 


He 


Val 


Phe 


Thr 


Val 


z u 


1 








5 










10 










15 




val 


Gly 


Asn 


Val 


Leu 


Val 


Val 


He 


Ala 


Val 


Leu 


Thr 


Ser Arg 


Ala 


Leu 










20 










25 










30 








Arg 


Ala 


Pro 


Gin 


Asn 


Leu 


Phe 


Leu 


Val 


Ser 


He 


Ala 


Ser 


Ala 


Asp 


He 








35 










40 










45 










Leu 


val 


Ala 


Thr 


Leu 


Val 


Met 


Pro 


Phe 


Ser 


Leu 


Ala 


Asn 


Glu 


He 


Met 


Z D 




50 










55 










60 












Tyr 


Trp 


Tyr 


Phe 


Gly 


Gin 


Val 


Trp 


Cys 


Gly 


val 


Tyr 


Leu 


Ala 


lie 


Asp 














70 










75 










80 




Val 


Leu 


Phe 


Cys 


Thr 


Ser 


Ser 


He 


Val 


His 


Leu 


Cys 


Ala 


He 


Ser 


Leu 












85 










90 










95 




30 


Asp 


Arg 


Tyr 


Trp 


Ser 


Val 


Thr 


Gin 


Ala 


Val 


Glu 


Tyr 


Asn 


Leu 


Lys 


Arg 










100 










105 










110 








Thr 


Pro 


Arg 


Arg 


Val 


Lys 


Ala 


Thr 


He 


Val 


Ala 


Val 


Trp 


Leu 


He 


Ser 








115 










120 










125 








35 


Ala 


Val 


lie 


Ser 


Phe 


Pro 


Pro 


Leu 


Val 


Ser 


Leu Tyr Arg Gin 


Pro 


Asp 




130 










135 










140 










Gly 


Ala 


Ala 


Tyr 


Pro 


Gin 


Cys 


Gly 


Leu 


Asn 


Asp 


Glu 


Thr 


Trp 


Tyr 


He 




145 










150 










155 










160 




Leu 


Ser 


Ser 


Cys 


He 


Gly 


Ser 


Phe 


Phe 


Ala 


Pro 


Cys 


Leu 


lis 


Tyr 


Leu 












165 










170 










175 




40 


Leu 


Val 


Tyr 


Ala 


Arg 


He 


Tyr 


Arg 


Val 


Ala 


Lys 


Arg Arg Thr 


Arg 


Thr 










180 










185 










190 








Leu 


Ser 


Glu 


Lys 


Arg 


Ala 


Pro 


Val 


Gly 


Pro 


Asp 


Gly 


Ala 


Ser 


Pro 


Thr 








195 










200 










205 










Thr 


Glu 


Asn 


Gly 


Leu 


Glv Ala Ala 


Ala 


Gly 


Glu 


Ala 


Arg 


Thr_ 


-Giy- 


Thr 


-*T5 




210 










215 










220 












Ala 


Arg 


Phe 


Leu 


Ser 


Arg 


Arg 


Arg 


Arg 


Ala 


Arg 


Ser 


Ser 


Val 


Cys 


Arg 




2 2" 










230 










235 








240 




Arg 


Lys 


Val 


Ala 


Gin 


Ala 


Arg 


Glu 


Lys 


Arg 


Phe 


Thr 


Phe 


Val 


Leu 


Ala 
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245 250 255 

Leu Val Phe Val Leu Cys Trp Phe Pro Phe Phe Phe lie Tyr Ser Leu 
260 265 270 

Tyr Gly lie Cys Arg Glu Ala Cys Gin Val Pro Gly Pro Leu Phe Lys 
5 275 280 285 

Phe Phe Phe Trp lie Gly Tyr Cys Asn Ser Ser Leu Asn Pro Val lie 
290 295 300 

Tyr Thr Val Phe Asn Gin Asp Phe Arg Pro Ser Phe Lys His lie Leu 
305 310 315 320 

10 Phe Arg Arg Arg Arg Arg Gly Phe Arg Gin 

325 330 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 30 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:20: 

2 0 Thr Ala Ala He Ala Ala Ala He Thr Phe Leu He Leu Phe Thr He 

1 5 10 15 

Phe Gly Asn Ala Leu Val He He Ala Val Leu Thr Ser Arg Ser Leu 
20 25 30 

Arg Ala Pro Gin Asn Leu Phe Leu Val Ser He Ala Ala Ala Asp He 

25 35 40 45 

Leu Val Ala Thr Leu He He Pro Phe Ser Leu Ala Asn Glu Leu Leu 
50 55 60 

Gly Tyr Trp Tyr Phe Arg Arg Thr Trp Cys Glu Val Tyr Leu Ala Leu 
65 70 75 80 

3 0 Asp Val Leu Phe Cys Thr Ser Ser He Val His Leu Cys Ala He Ser 

85 90 95 

Leu Asp Arg Tyr Trp Ala Val Ser Arg Ala Leu Glu Tyr Asn Ser Lys 
100 105 110 

Arg Thr Pro Arg Arg He Lys Cys He He Leu Thr Val Trp Leu He 
35 us 120 125 

Ala Ala Val He Ser Leu Pro Pro Leu He Tyr Lys Gly Asp Gin Gly 
130 135 140 

Pro Gin Pro Arg Gly Arg Pro Gin Cys Lys Leu Asn Gin Glu Ala Trp 
145 150 155 160 

Tyr He Leu Ser Ser He Gly Ser Phe Phe Ala Pro Cys Leu He Leu 
165 170 175 

Leu Val Tyr Leu Arg He Tyr Leu He Ala Lys Arg Ser Asn Arg Arg 
180 185 190 

Gly Pro Arg Ala Lys Cys Gly Pro Gly Gin Gly Glu Ser Lys Gin Pro 
45 195 200 205 

Arg Pro Asp His Gly Gly Ala lie Ala Ser Ala Lys Leu Pro Ala He 



40 
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210 215 220 

Ala Ser Gly Arg Gly Val Gly Ala He Gly Gly Gin Trp Trp Arg Arg 
225 230 235 240 

Arg Ala His Va-1 Thr Arg Glu Lys Arg Phe Thr Phe Val Leu Ala Val 
5 245 250 255 

Val He Gly Val Phe Val Leu Cys Trp Phe Pro Phe Phe Phe Ser Tyr 
260 265 270 

Ser Leu Gly Ala He Cys Pro Lys His Cys Lys Val Pro His Gly Leu 
275 280 285 

10 Phe Gin Phe Phe Phe Trp He Gly Tyr Cys Asn Ser Ser Leu Asn Pro 

290 295 300 

Val He Tyr Thr He Phe Asn Gin Asp Phe Arg Met Phe Arg Arg He 
305 310 315 320 

Leu Cys Arg Pro Trp Thr Gin Thr Ala Trp 
15 325 330 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 

2 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Thr Leu Thr Leu Val Cys He Ala Cys Leu Ser Leu Thr Val Phe Gly 
25 1 5 10 15 

Asn Val Leu Val He He Ala Val Phe Thr Ser Arg Ala Leu Lys Ala 

20 . 25 30 

Pro Gin Asn Leu Phe Leu Val Ser He Ala Ser Ala Asp He Leu Val 
35 40 45 

3 0 Ala Thr Leu Val He Pro Phe Ser Leu Ala Asn Glu Val Asn Gly Tyr 

50 55 60 

Trp Tyr Phe Gly Lys Trp Cys Glu He Tyr Leu Ala Leu Asp Val Ley 
65 70 75 80 

Phe Cys Thr Ser Ser He Val His Leu Cys Ala He Ser Leu Asp Arg 
35 85 90 95 

Tyr Trp Ser He Thr Gin Ala He Glu Tyr Asn Leu Lys Arg Thr Pro 
100 105 110 

Arg Arg He Lys Ala He He He Thr Val Trp Val He Ser Ala Val 
115 120 125 



40 



lie Ser Phe Pro Pro Leu He Ser He Glu Lys Lys Gly Gly Gly Gly 
130 135 140 

Gly Pro Gin Pro Ala Glu Pro Arg Cys Glu He Asn Asd Gin Lys Trp 

-las i£G ±ss : nir 



Tyr Val He Ser Ser Cys He Gly Ser Phe Phe Ala Pro Cys Leu He 

45 165 170 175 

Trp Leu Val Tyr Val Arg He Tyr Gin He Ala Lys Arg Arg Thr Arg 

180 185 190 
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Val Pro Pro Ser Arg Arg Asp Pro Asp Ala Val Ala Ala Pro Pro Gly 
195 200 205 

Gly Thr Glu Arg Arg Pro Asn Gly Leu Gly Pro Glu Arg Ser Ala Gly 
210 215 220 

5 Pro Gly Gly Gly Arg Gly Arg Ser Ala Ser Gly Leu Pro Arg Arg Arg 

225 230 235 240 

Ala Gly Ala Gly Gly Gin Asn Arg Glu Lys Arg Phe Thr Phe Val He 
245 250 255 

Ala Val Val He Gly Val Phe Val Val Cys Trp Phe Pro Phe Phe Phe 
10 260 265 270 

Thr Tyr Thr Leu Thr Ala Val Leu Cys Ser Val Pro Arg Thr Leu Phe 
275 280 285 

Lys Phe Phe Phe Trp Phe Gly Tyr Cys Asn Ser Ser Leu Asn Pro Val 
290 295 300 

15 He Tyr Thr He Phe Asn His Asp Phe Arg Arg Ala Phe Lys Lys He 

305 310 315 320 

Leu Cys Arg Gly Asp Arg Lys Arg He Val 
325 330 

(2) INFORMATION FOR SEQ ID NO: 22: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Thr Leu Thr Leu Val Cys He Ala Gly Leu He Met Leu Ph- Thr Val 
1 5 10 15 

Pn e Gly Asn Val Leu Val He He Ala Val Phe Thr Ser Arg Ala Leu 
30 20 25 30 

Lys Ala Pro Gin Asn Leu Phe Leu Val Ser He Ala Ser Ala Asp He 
35 40 45 

Leu Val Ala Thr Leu Val He Pro Phe Ser Leu Ala Asn Glu Val Met 
50 55 60 

3 5 Tyr Trp Tyr Phe Gly Lys Val Trp Cys Glu He Tyr Leu Ala He Asp 

fi 5 70 75 80 

Val Leu Phe Cys Thr Ser Ser He Val His Leu Cys Ala He Ser Leu 
85 90 95 

Asp Arg Tyr Trp Ser He Thr Gin Ala He Glu Tyr Asn Leu Lys Aro 
40 100 105 no 

Thr Pro Arg Arg He Lys Ala He He Val Thr Val Trp Val He Ser 
115 120 125 

Ala Val He Ser Phe Pro Pro Leu Leu lie Ser He Glu Lys Lys Gly 
130 135 140. 

4 5 Ala Gly Gly Gly Gin Gin Pro Ala Glu Pro Ser Cys Lys He Asn Asd 

145 150 155 160 

Gin Lys Trp Tyr Val He Ser Ser Ser He Gly Ser Phe Phe Ala Pro 
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165 170 175 

Cys Leu lie Asn His Leu Val Tyr Val Arg lie Tyr Gin He Ala Lys 
180 * 185 190 

Arg Arg Thr Arg Val Pro Pro Ser Arg Arg Gly Pro Asp Ala Cys Ser 
5 195 200 205 

Ala Pro Pro Gly Gly Ala Asp Arg Arg Pro Asn Ala Val Gly Pro Glu 
210 215 220 

Arg Gly Ala Gly Thr Ala Gly Gly Gin Gly Glu Glu Arg Ala Gly Gly 
225 230 235 240 

10 Ala Lys Ala Ser Arg Trp Arg Gly Arg Gin Asn Arg Glu Lys Arg Phe 

245 250 255 

Thr Phe Val He Ala Val Val He Gly Val Phe Val Val Cys Trp Phe 
260 265 270 

Pro Phe Phe Phe Thr Tyr Thr Leu He Ala Val Gly Cys Pro Val Pro 
15 275 280 285 

Tyr Gin Leu Phe Asn Phe Phe Phe Trp Phe Gly Tyr Cys Asn Ser Ser 
290 295 300 

Leu Asn Pro Val He Tyr Thr lie Phe Asn His Asp Phe Arg Arg Ala 

305 310 315 320 

2 0 Phe Lys Lys He Leu Cys Arg Gly Asp Arg Lys Arg He Val 

325 330 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 
2 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

30 Leu Leu Thr Ala Leu Val Leu Ser Val He He Val Leu Thr He He 

15 10 15 

Gly Asn He Leu Val He Leu Ser Val Phe Thr Tyr Lys Pro Leu Arg 
20 25 30 

He Val Gin Asn Phe Phe He Val Ser He Ala Val Ala Asp Leu Thr 
35 35 40 45 

Val Ala Leu Leu Val Leu Pro Phe Trp Ala Tyr Ser He Leu Gly Arg 
50 55 60 

Trp Glu Phe Gly He His Leu Cys Lys Leu Trp Leu Thr Cys Asp Val 
65 70 75 80 

4 0 Leu Cys Cys Thr Ser Ser He Leu Asn Leu Cys Ala He Ala Leu Asp 

85 90 95 

Ara Tyr Trp Ala Tie Thr &cp Dm Tla ftrn Tyr- Al a C. T n Tyr 7^ r-g TW 

100 105 110 

Val Gly Arg Val Leu Leu Leu He Ser Gly Val Trp Leu Leu Ser Leu 
45 115 120 125 

Leu lie Ser Ser f -ro Pro Leu lie Gly Trp Asn Asp Trp Pro Asp Glu 
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130 135 140 

Phe Thr Ser Ala Thr Pro Cys Glu Leu Thr Ser Gin Arg lie Gly Tyr 
145 . 150 155 160 

Val lie Tyr Ser Ser Leu Gly Ser Phe Phe He Pro He Ala He Met 
5 165 170 175 

Arg He Val Tyr He Glu He Phe Val Ala Thr Arg Arg Arg Leu Arg 
180 185 190 

Glu Arg Ala Arg Ala Asn Lys He Asn Thr He Ala Leu Lys Ser Thr 
195 200 205 

10 Glu Leu Glu Pro Met Ala Asn Ser Ser Pro Val Ala Ala Ser Asn Ser 

210 215 220 

Gly Ser Lys Lys Lys Thr Ser Gly Val Asn Gin Phe He Glu Glu Lys 
225 230 235 240 

Gin Lys He Ser Leu Ser Lys Glu Arg Arg Ala Ala Arg Thr Leu Gly 

15 - 245 250 255 

He He Met Val the Val He Cys Trp Leu Pro Phe Phe He Met Tyr 
260 265 270 

Val He Leu Pro Phe Cys Cys Pro Thr Asn Lys Phe Lys Asn Phe He 
275 280 285 



20 



25 



Thr Trp Leu Gly Tyr He Asn Ser Gly Leu Asn Pro Val He Tyr Thr 
290 295 300 

He Phe Asn Leu Asp Tyr Arg Arg Ala Phe Lys Arg Leu Leu Gly Leu 
305 310 315 320 

Asn 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Arg He Leu Thr Ala Cys Phe Leu Ser Leu Leu He Leu Ser Thr Leu 

35 l 5 io is 

Leu Gly Asn Thr Leu Val Cys Ala Ala Val He Arg Phe Arg His Leu 
20 25 30 

Arg Ser Lys Val Thr Asn Phe Phe Val lie Ser Leu Ala Val Ser Asp 
35 40 45 

4 0 Leu Leu Val Ala Val Leu Leu Trp Lys Ala Val Ala Glu lie Ala Glv 

50 55 60 

Phe Trp Pro Phe Gly Ser Phe Cys Asn He Trp Val Ala Phe Asp He 
65 70 75 80 

Met Cys Ser Thr Ala Ser He Leu Asn Leu Cys Val He Ser Val Asp 
45 85 90 95 

Arg Tyr Trp Ala He Ser Ser Pro Phe Arg Tyr Glu Arg Lys Lys Arg 
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100 105 110 

Pro Lys Ala Ala Phe lie Leu He Ser Val Ala Trp Thr Leu Ser Val 
115 120 125 

Leu He Ser Phe He Pro Val Gin Leu Ser Trp His Lys Ala Lys Pro 
5 130 135 140 

Thr Ser Pro Ser Asp Gly Met Ala Thr Ser Leu Ala Glu Thr He Asp 
145 150 155 160 

Asn Cys Asp Ser Ser Leu Ser Arg Thr Tyr Ala He Ser Ser Ser Val 
165 170 175 

10 He Ser Phe Tyr He Pro Val Ala He Leu Val Thr Tyr Thr Arg He 

180 185 190 

Tyr Arg He Ala Gin Lys Gin He Arg Arg He Ala Ala Leu Glu Arg 
195 200 205 

Ala Ala Val His Ala Lys Asn Cys Gin Gly Asn Lys Pro Val Glu Cys 
15 210 215 220 

Ser Gin Pro Glu Ser Ser Phe Met Ser Phe Lys Arg Glu Thr Lys Val 
225 230 235 240 

Leu Lys Thr Leu Ser Val He Thr Cys Val Phe Val Cys Cys Trp Leu 
245 250 255 

20 Pro Phe Phe He Leu Asn Cys lie Leu Pro Phe Cys Glv Se^- Gly Glu 

260 265 270 

Thr Gin Pro Phe Cys Thr Asp Ser Asn Thr Phe Asp Val Phe Val Trp 
275 280 285 

Phe Gly Trp Ala Asn Ser Ser Leu Asn Pro He He Tyr Ala Phe Asn 
25 290 295 300 

Ala Asp Phe Arg Lys Ala Phe Ser Thr Leu Leu Gly Cys Tyr Arg Leu 
305 310 315 320 

Cys Pro Ala Thr Asn Met Ala He Glu Thr Val Ser He Asn Asn Gly 
325 330 335 

3 0 Ala Ala Met Phe Ser Ser His His Glu Pro Arg Gly Ser He Ser Lys 

340 345 350 

Glu Cys Asn Leu Val Tyr Leu He Pro His Ala Val Gly Ser Ser Glu 
355 360 365 

Asp Leu Lys Lys Glu 
35 370 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 

4 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Gin Trp Thr Ala Cys Leu Leu Thr Leu Leu He He Trp Thr Leu Leu 
45 1 5 10 15 



Gly Asn Val Leu Val Cys Ala Ala He Val Arg Ser Arg His Leu Leu 
20 25 30 
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Val Phe He Val Ser He Ala Val Ser Asp Leu Phe Val Ala Leu Leu 
35 40 45 

Val Asn Thr Trp Lys Ala Tyr Ala Glu Val Ala Gly Tyr Trp Pro Phe 
50 55 60 

5 Gly Ala Phe Cys Asp Val Trp Val Ala Phe Asp He Met Cvs Ser Thr 

65 70 75 " 80 

Ala Ser He Leu Asn Leu Cys Val He Ser Val Asp Arg Tyr Trp Ala 
85 90 95 

He Ser Arg Pro Phe Arg Tyr Lys Ala Leu Val Met Val Gly He Ala 
10 100 105 11C 

Trp Thr Leu Ser He Leu He Ser Phe He Pro Val Gin He Asn Trp 
115 . 120 125 

Asn Arg Asp Gin Ala Ala Ser Trp Gly Gly Leu Asp Leu Pro Asn Asn 
130 135 140 

15 He Asp Cys Asp Ser Ser Leu Asn Arg Thr Tyr Ala He Ser Ser Ser 

145 150 155 160 

Leu He Ser Phe Tyr He Pro Val Ala He Leu Val Thr Tyr Thr Arg 
165 170 175 

He Tyr Arg He Ala Gin Val Gin He Arg Arg He Ser Ser Leu Glu 
20 180 185 190 

Arg Ala Ala Glu His Ala Gin Ser Cys Arg Ser Ser Ala Ala Cys Ala 
195 200 205 



25 



Pro Asp Thr Ser Leu Arg Ala Ser He Lys Lys Glu Thr Lys Val Leu 
210 215 220 

Lys Thr Leu Ser Val He He Cys Val Phe Val Cys Cys Trp Leu Pro 
225 230 235 240 

Phe Phe He Leu Asn Cys Met Val Pro Phe Cys Ser Gly Hij Pro Glu 
245 250 255 

Gly Pro Pro Ala Gly Phe Pro Cys Val Ser Glu Thr Thr Phe Asp Val 
30 260 265 270 

Phe Val Trp Phe Gly Trp Ala Asn Ser Ser Leu Asn Pro Val He Tyr 
275 280 285 

Ala Phe Asn Ala Asp Phe Gin Lys Val Phe Ala Gin Leu Leu Cys Ser 
290 295 300 

35 His Phe Cys Ser Arg Thr Pro Val Glu Thr Val Asn He Ser Asn Glu 

305 310 315 320 

Leu He Ser Tyr Asn Gin Asp He Val Phe His Lys Glu He Ala Ala 
325 330 335 

Ala Tyr He His Met Met Pro Asn Ala Val Thr Pro Glv Asn Arg Glu 
4 ° 340 345 * 350 

Val Asp Asn Asp Glu Glu Glu Gly 
355 360 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 
4 5 (A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
5 Tyr Asn Tyr Tyr Ala Thr Leu Leu Thr Leu Leu He Ala Va^ He Val 

15 10 15 

Phe Gly Asn Val Leu Val Cys Met Ala Val Ser Arg Glu Lys Ala Leu 
20 25 30 

Gin Thr Met Asn Tyr Leu He Val Ser lie Ala Val Ala Asp Leu Leu 
10 35 40 45 

Val Ala Thr Leu Val Trp Trp Trp Tyr Leu Glu Val Val Gly Glu Trp 
50 55 60 

Lys Phe Ser Arg lie His Cys Asp lie Phe Val Thr Leu Asp He Thr 
65 70 75 80 

15 Ala Ser He Leu Asn Leu Cys Ala lie Ser He Asp Arg Tyr Thr Ala 

85 90 95 

Val Ala Met Pro Met Leu Tyr Asn Thr Arg Tyr Ser Ser Lys Arg Arg 
100 105 110 

Val Thr Val Met lie Ser He Val Trp Val Leu Ser Phe Thr lie Ser 
20 us 120 125 

Cys Pro Leu Leu Phe Gly Leu Asn Asn Ala Asp Gin Asn Glu Cys lie 
130 135 140 

lie Ala Asn Pro Ala Phe Val Val Tyr Ser Ser He Val Se- Phe Tyr 
145 150 155 160 

2 5 Val Pro Phe lie Val Thr Leu Leu Val Tyr He Lys He Tyr lie Val 

165 170 175 

Leu Arg Arg Arg Arg Lys Arg Val Asn Thr Lys Arg Ser Ser Arg Ala 
180 185 190 

Phe Arg Ala His Leu Arg Ala Pro Leu Lys Gly Asn Cys Thr His Pro 
30 195 200 205 

Glu Asp Met Lys Leu Cys Thr Val He Pro Asn Gly Lys Thr Arg Thr 
210 215 220 

Ser Leu Lys Thr Met Ser Arg Arg Lys Leu Ser Gin Gin Lys Glu Lys 
225 230 235 240 

3 5 Lys Ala Thr Gin Met He Ala lie Val Leu Gly Val Phe He lie Cys 

245 250 255 

Lys Leu Pro Phe Phe lie Thr His lie Leu Asn lie His Cys Asp Cys 
260 265 270 

Asn lie Pro Pro Val Leu Tyr Ser Ala Phe Thr Trp Leu Gly Tyr Val 
40 275 280 285 

Asn Ser Ala Val Asn Pro lie lie Tyr Thr Thr Phe Asn He Glu Phe 
2a£ 2$$ 5trt) 

Arg Lys Ala Phe Leu Lys lie Leu His Cvs 
305 310 

45 (2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
5 (ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ala Tyr Tyr Ala Leu Ser Tyr Cys Ala Leu lie Leu Ala lie Val Phe 

15 10 15 

Gly Asn Gly Leu Val Cys Met Ala Val Leu Arg Glu Lys Ala Leu Gin 
10 20 25 30 

Thr Thr Thr Asn Tyr Leu Val Val Ser Leu Ala Val Ala Asp Leu Leu 
35 40 45 

Val Ala Thr Leu Val Trp Trp Val Val Tyr Leu Glu Val Thr Gly Gly 
50 55 60 

15 Val Trp Asn Phe Ser Arg lie Cys Cys Asp Val Phe Val Thr Leu Asp 

65 70 75 80 

Val Met Met Thr Ala Ser lie Leu Asn Leu Cys Ala lie Ser lie Asp 
85 90 95 

Arg Tyr Thr Ala Val His Tyr Gin His Gly Thr Gly Gin Ser Ser Cys 
20 100 105 110 

Arg Arg Val Ala He Met He Thr Ala Val Trp Val Leu Ala Phe Ala 
115 120 125 

Val Ser Cys Pro Leu Leu Phe Gly Phe Asn Thr Gly Asp Pro Thr Val 
130 135 140 

2 5 Cys Ser He Ser Asn Pro Asp Phe Val He Tyr Ser Ser Val Val Ser 

145 150 155 160 

Phe Tyr Leu Pro Phe Gly Val Thr Val Leu Val Tyr Ala Arg He Tyr 
165 170 175 

Val Val Leu Lys Gin Arg Arg Arg Lys Arg He Leu Thr Arg Gin Asn 
30 180 185 190 

Ser Gin Cys Asn Ser Val Arg Pro Gly Phe Pro Gin Gin Ser Thr Ser 
195 200 205 , 

Leu Pro Asp Pro Ala His Leu Glu Leu Lys Arg Ser Asn Gly Arg Leu 
210 215 220 

3 5 Ser Thr Ser Leu Lys Leu Pro Leu Gin Pro Arg Gly Val Pro Leu Arg 

225 230 235 240 

Glu Lys Lys Ala Thr Gin Met Val Ala He Val Leu Gly Ala Phe He 
245 250 255 

Val Cys Trp Leu Pro Phe Phe Leu Thr His Val He Asn Thr His Cys 
40 260 265 270 

Gin Thr Cys His Val Ser Pro Glu Leu Tyr Ser Ala Thr Thr Trp Leu 
275 280 285 

Gly Tyr Val Asn Ser Ala Leu Asn Pro Val lie Tyr Thr Thr Phe Asn 
290 295 300 

45 .He Glu Phe Arg Lys Ala Phe Leu Lys He Leu Ser Cys 

305 310 315 
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30 



35 
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2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Gly Ala Ala Ala Leu Val Gly Gly Val Leu Leu He Cys Ala Val Leu 
1 5 10 15 

Ala Gly Asn Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala Leu 
20 25 30 

Gin Thr Pro Thr Asn Ser Phe He Val Ser Leu Ala Ala Ala Asp Leu 
35 40 45 

Leu Leu Ala Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val Gin 
50 55 60 

Gly Ala Ala Trp Leu Leu Ser Pro Arg Leu Cys Asp Val Met Leu Cys 

65 70 75 80 

Thr Ala Ser He Phe Asn Leu Cys Ala He Ser Val Asp Ar~ Phe Val 
85 90 95 

Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly Ser Arg Arg 
100 105 110 

Gin Leu Leu Leu He Gly Ala Thr Trp Leu Leu Ser Ala Ala Val Ala 

115 120 125 

Ala Pro Val Leu Cys Gly Leu Asn Asp Val Arg Gly Arg Asp Pro Ala 
130 135 140 

Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser Ser Val Cys 
145 150 155 160 

Ser Phe Phe Leu Pro Cys Pro Leu Leu Tyr Trp Ala Thr Phe Arg Gly 
165 170 175 

Leu Gin Leu Val Ala Arg Arg Ala Lys Leu His Gly Arg Ala Pro Arg 
180 185 190 

Arg Pro Ser Gly Pro Gly Pro Pro Ser Pro Thr Pro Pro Ala Pro Arg 
195 200 205 

Leu Pro Gin Asp Pro Cys Gly Ala Leu Pro Pro Gin Thr Pro Pro Gin 
210 215 220 

Thr Arg Arg Arg Arg Arg Ala Lys He Thr Gly Arg Glu Ar^ Lys Ala 
225 230 235 240 

Met Arg Val Leu Pro Val Val Val Gly Ala Phe lie Leu Cys Trp Thr 
245 250 255 

Pro Phe Phe Val Val His He Thr Gin Ala Leu Cys Pro Ala Cys Ser 
260 265 270 

V al Pro Pro Arg Lou Val Sei Ala Val Tin Tip L5U Sfer Tyr Val Asn 

275 280 285 

Ser Ala He Asn Pro Val lie Tyr Thr Val Phe Asn Ala Glu Phe Arg 
290 295 300 

Asn Val Phe Arg Lys Ala Leu Arg Ala Cys Cys 



WO 94/05695 



PCI7US93/08528 



- 82 - 

305 310 315 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 327 amino acids 

5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
di) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
10 Lys He Ser Leu Ala Val Val Leu Ser Val He Thr Leu Ala Thr Val 

15 10 15 

Leu Ser Asn Ala Phe Val Leu Thr Arg He Leu Leu Thr Arg Lys Leu 
20 25 30 

His Thr Pro Ala Asn Tyr Leu He Gly Ser He Ala Thr Thr Asp Leu 
15 35 40 45 

Leu Val Ser He Leu Val Trp He Ser lie Ala Tyr Thr He Thr His 
50 55 60 

Thr Trp Asn Phe Gly Gin He Leu Cys Asp He Trp Leu Ser Ser Asp 
65 70 75 80 

20 He Thr Cys Cys ^hr Ala Ser He Leu His Leu Cys Val He Ala Leu 

85 90 95 

Asp Arg Tyr Trp Ala He Thr Asp Ala Leu Glu Tyr Ser Lys Arg Arg 
100 105 110 

Thr Ala Gly His Ala Ala Thr Met He Ala He Val Trp Ala He Ser 
25 115 120 125 

He Cys He Ser He Pro Pro Leu Phe Trp Arg Ala Lys Ala Gin Glu 
130 135 140 

Glu Met Ser Asp Cys Leu Val Asn Thr Ser Gin Ser Tyr Thr He Tyr 
145 150 155 160 

30 Ser Thr Cys Gly Ala Phe Tyr He Pro Ser Val Leu Leu He He Leu 

165 170 175 

Tyr Gly Arg He Tyr Arg Ala Ala Arg Asn Arg He Leu Asn Pro Pro 
180 185 190 

Ser Leu Tyr Gly Lys Arg Phe Thr Thr Ala His Leu He Thr Gly Ser 
35 195 200 205 

Ala Gly Ser Ser Leu Cys Ser Leu Asn Ser Ser Leu His Glu Gly His 
210 215 220 

Asn His Val Lys He Lys Leu Ala Asp Ser Ala Leu Glu Arg Lys Arg 
225 230 235 240 

40 He Ser Ala Ala Arg Glu Arg Lys Ala Thr Lys He Leu Gly He He 

245 250 255 

Leu Gly Ala Phe He He Cys Trp Leu Pro Phe Phe Val Val Ser Leu 
260 265 27 

Val. Leu Pro He Cys Arg Asp Ser Cys Trp He His Pro Ala Leu Phe 
45 275, 280 285 

Asp Phe Phe Thr Trp Leu Gly Tyr He Asn Ser Leu He Asn Pro lie 
290 295 300 
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He Tyr Thr Val Phe Asn Glu Glu Phe Arg Gin Ala Phe Gin Lys He 
305 310 315 320 

Val Pro Phe Arg Lys Ala Ser 
325 

5 (2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Val He Thr Ser Leu Leu Leu Gly Thr Leu He Phe Cys Ala Val Leu 
15 10 15 

15 Gly Asn Ala Cys Val Val Ala Ala He Ala Leu Glu Arg Ser Leu Gin 

20 25 30 

Asn Val Ala Asn Tyr Leu lie Gly Ser Leu Ala Val Arg Asp Leu Met 
35 40 45 

Val Ser Val Leu Val Leu Pro Met Ala Ala Leu Tyr Gin Val Leu Asn 
20 50 55 60 

Lys Trp Thr Leu Gly Gin Val Thr Cys Asp Leu Phe He Ala Leu Asp 
65 70 75 80 

Val Leu Cys Cys Thr Ser Ser He Leu His Leu Cys Ala He Ala Leu 
85 90 95 

2 5 Asp Arg Tyr Trp Ala He Thr Asp Pro He Asp Tyr Val As:i Lys Arg 

100 105 11C 

Thr Pro Arg Pro Arg Ala Leu He Ser Leu Thr Trp Leu He Gly Phe 
115 120 125 

Leu He Ser He Pro Pro Met Leu Gly Trp Arg Thr Pro Glu Asp Arg 
30 130 135 140 

Ser Asp Pro Asp Ala Cys Thr He Ser Lys Asp His Gly Tyr Thr He 
145 150 155 160 

Tyr Ser Thr He Phe Ala Phe Tyr He Pro Leu Leu Leu Met Leu Val 
165 170 175 

3 5 Leu Tyr Gly Arg He Phe Arg Ala Ala Arg Phe Arg He Arg Lys Thr 

180 185 190 

Val Lys Lys Val Glu Lys Thr Gly Ala Asp Thr Arg His Gly Ala Ser 
195 200 205 

Pro Ala Pro Gin Pro Lys Lys Ser Val Asn Gly Glu Ser Gly Ser Arg 
40 210 215 220 

Asn Ala Ser Phe Glu Arg Lys Asn Glu Arg Asn Ala Phe Ala Lys Leu 

225 230 235 240 

L*u Ala Arg rtln Arg T.yr Thr V-^^ Tyr Thr T . nu P . Ty TIm TV. Mui Tin 

245 250 255 

4 5 Phe He Leu Cys Trp Leu Pro Phe Phe He Val Ala Leu Val Leu Pro 

260 265 270 

Phe Cys Glu Ser Ser Cys His Met Pro Thr Leu lie Arg Ala He lie 
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275 280 285 

Asn Trp Leu Cys Val lie Asn Ser Leu Leu Asn Pro Val lie Tyr Ala 
290 295 300 

Tyr Phe Asn Lys Asp Phe Gin Asn Ala Phe Lvs Lys lie lie Lys Cys 
5 305 310 315 320 

Asn Phe Cys Arg Gin 
325 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 3 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gin Asn Trp Pro Ala Leu Ser He Val Val He He He Asn Thr He 
15 10 15 

Gly Gly Asn He Leu Val He Met Ala Val Ser Lys Lys Leu His Asn 
20 25 30 

20 Ala Thr Asn Tyr Phe Leu Met Ser He Ala He Ala Asp Me* Leu Val 

35 40 45 

Gly Phe Leu Val Trp Leu Ser Leu Leu Ala He Leu Tyr Asp Tyr Val 
50 55 60 

Trp Pro Leu Pro Arg Tyr Leu Cys Pro Val Trp He Ser Leu Asp Val 
25 65 70 75 80 

Leu Phe Ser Thr Ala Ser He Met His Leu Cys Ala He Ser Leu Asp 
85 90 95 



30 



Arg Tyr Val Ala He Arg Asn Pro He Glu His Ser Arg Phe Ser Arg 
100 105 110 

Thr Lys Ala He Met Lys He Ala He Val Trp Ala He Ser He Gly 
115 120 125 

Val Ser Val Pro He Pro Val He Gly Leu Arg Asp Glu Ser Lys Val 
130 135 140 

Phe Val Asn Asn Thr Thr He Cys Val Leu Asn Asp Pro Asn Phe Val 
35 145 150 155 160 

Leu He Gly Ser Phe Val Ala Phe Phe He Pro Thr Leu He Met Val 
165 170 175 

He Thr Tyr Phe Leu Thr He Tyr Val Leu Arg Arg Gin Th. Leu Met 
180 185 190 

4 0 Leu Leu Arg Gly His Thr Glu Glu Glu lie Ala Met Ser Leu Asn Phe 

195 200 205 

Leu Asn Cys Cys Cys Lys Lys Asn Gly Gly Glu Glu Glu Asn Ala Pro 
210 215 220 

Asn Asn Pro Asn Pro Asp Gin Lys Pro Arg Arg Lys Lys Lys Glu Lys 
45 225 230 235 240 

Arg Pro Arg Gly Thr Met Gin Ala He Asn Asn Glu Lys Lys Ala Ser 
245 250 255 
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Lys Val Leu Gly He Val Phe Phe Val Phe Leu He Met Trp Cys Pro 
260 265 270 

Phe Phe He Thr **sn He Leu Ser Val Leu Cys Gly Lys Ala Cys Asn 
275 280 285 

5 Gin Cys Lys Leu Leu Asn Val Phe Val Trp He Gly Tyr Val Cys Ser 

290 295 300 

Gly He Asn Pro Val He Tyr Thr Leu Phe Asn Lys He Tyr Arg Arg 
305 310 315 320 

Ala Phe Ser Lys Tyr Leu Arg Cys Asp Tyr Lys Pro Asp Lys Lys Pro 
10 325 330 335 

Pro Val Arg Gin lie Pro Arg Val Ala Ala Thr Ala Leu Ser Gly Arg 
340 345 350 

Glu Leu Asn Val Asn He Tyr Arg His Thr Asn Glu Arg Val Ala Arg 
355 360 365 

15 Lys Ala Asn Asp Pro Glu Pro Gly He Glu Asn Gin Val Glu Asn Leu 

370 375 380 

Glu 
385 

(2) INFORMATION FOR SEQ ID NO: 32: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Lys Asn Trp Ser Ala Leu Leu Thr Thr Val Val He He Leu Thr He 
15 10 15 

Ala Gly Asn He Leu Val He Met Ala Val Ser Leu Glu Lys Lys Leu 
30 20 25 30 

Gin Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala He Ala Asp Met 
35 40 45 

Leu Leu Gly Phe Leu Val Trp Val Ser Asn Glu Thr He Leu Tyr Gly 
50 55 60 

3 5 Tyr Arg Trp Pro Leu Pro Ser Lys Leu Cys Ala He Trp He Tyr Leu 

65 70 75 80 

Asp Val Leu Phe Ser Thr Ala Ser He Met His Leu Cys Ala He Ser 
85 90 95 

Leu Asp Arg Tyr Val Ala He Gin Asn Pro He His His Ser Arg Phe 

40 100 105 110 

Asn Ser Arg Thr Lys Ala Phe Leu Lys He lie Ala Val Trp Thr lie 
115 120 125 

Ser Val Gly lie Ser Met Pro lie Pro Val Phe Gly Leu Gin Asp Asp 
130 135 140 



45 



Ser Lys Val Phe Lys Glu Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe 
145 150 155 " 160 
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Val Leu He Gly Ser Phe Val Ala Phe Phe He Pro Leu Thr He Met 
165 170 175 

Val He Thr Tyr Phe Leu Thr lie Lys Ser Leu Arg Gin Lys Phe Ala 
180 185 190 

5 Thr Leu Cys Val Ser Asp Leu Ser Thr Arg Ala Lys Leu Ala Ser Phe 

195 200 205 

Ser Phe Leu Pro Gin Ser Ser Leu Ser Ser Glu Lys Leu Phe Gin Arg 
210 215 220 

Ser He His Arg Glu Pro Gly Ser Tyr Ala Gly Arg Lys Thr Met Gin 
10 225 230 235 240 

Ser He Ser Asn Glu Gin Lys Ala Cys Lys Val Leu Gly He Val Phe 
245 250 255 

Phe Leu Phe Val Val Met Trp Cys Pro Phe Phe He Thr Asn He Met 
260 265 270 

15 Val He Cys Lys Glu Ser Cys Asn Glu Asn Val He Gly Ala Leu Leu 

275 280 285 

Asn Val Phe Val Trp lie Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu 
290 295 300 

Val Tyr Thr Leu Phe Asn Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr 
20 305 310 315 320 

Leu Gin Cys Gin Tyr Lys Glu Asn Arg Lys Pro Leu Leu He Leu Val 
325 330 335 

Asn Thr He Pro Ala Leu Ala Tyr Lys Ser Ser Gin Leu Gin Val Gly 
340 345 350 

2 5 Gin Lys Lys Asn Ser Gin Glu Asp Ala Glu Gin Thr Val As^j Asp Cys 

355 360 365 

Ser Met Val Thr Leu Gly Lys Gin Gin Ser Glu 
370 375 

(2) INFORMATION FOR SEQ ID NO: 33: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

3 5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

He Thr He Thr Val Val Leu Ala Val Leu He Leu He Thr Val Ala 
15 10 15 

Gly Asn Val Val Val Cys He Ala Val Gly He Asn Arg Arg Leu Arg 
40 20 25 30 

Asn Leu Thr Asn Cys Phe He Val Ser Leu Ala He Thr Asp Leu Leu 
35 40 45 

Leu Gly Leu Leu Val Leu Pro Phe Ser Ala He Tyr Gin Leu Ser Cys 
50 55 60 

45 Lys Trp Ser Phe Gly Lys Val Phe Cys Asn He Tyr Thr Ser Leu Asp 

65 70 75 80 

Val Met Leu Cys Thr Ala Ser He Leu Asn Leu Leu He Ser Leu Asp 
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85 



90 



95 



Arg Tyr Cys Ala Val Met Asp Pro Leu Arg Tyr Pro Val Leu Val Arg 
100 105 110 

Pro Val Arg Val Ala He Ser Leu Val Leu He Trp Val He Ser He 

115 120 125 

Thr Leu Ser Phe Leu Ser He His Leu Gly Trp Asa Ser Arg Asn Glu 

130 135 140 



10 



30 



Thr Ser Lys Gly Asn His Thr Thr Ser Lys Cys Lys Val Gin Val Asn 

145 150 155 160 

Glu Val Tyr Gly Leu Val Asp Gly Leu Val Thr Phe Tyr Leu Pro Leu 

165 170 175 



Leu lie Met Cys He Thr Tyr Tyr Arg He Phe Lys Val Ala Arg Asp 
180 185 l&o 

Ala Lys Arg Asn 51 is lie Ser Ser Trp Lys Ala Ala Thr He Arg Glu 
15 195 200 205 

His Lys Ala Thr Val Thr lie Ala Ala Val Met Ala Phe lie lie Cys 
210 215 220 

Trp Phe Pro Tyr Phe Thr Ala Phe Val Tyr Arg Gly Leu Arg Glv Asp 
225 230 235 * 240 

20 Asp Ala He Asn Glu Val Leu Glu Ala lie Val Leu Trp Leu Gly Tyr 

245 250 255 

Ala Asn Ser Ala Leu Asn Pro lie Leu Tyr Ala Ala Leu Asn Arg Asp 
260 265 270 

Phe Arg Thr Gly Tyr Gin Gin Leu Phe Cys Cys Arg He Ala Asn Arg 

25 275 280 285 

Asn Ser His Lys Thr Ser Leu Arg Ser Asn Ala Ser Gin Leu Ser Arg 
290 295 300 

Thr Gin Ser Arg Glu Pro Arg Gin Gin Glu Glu Lys Pro Leu Lys Leu 
305 310 315 320 

Gin Val Trp Ser Gly Thr Glu Val Thr Ala Pro Gin Gly Ala Thr Asp 
325 . 330 335 



Arg 



(2) INFORMATION FOR SEQ ID NO:34: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

lie li e Thr Tvr t.ph Pho zn a w a i nhn imi T LL . y 

1 5 10 15 

Gly Asn Gly Leu Val lie Trp Val Ala Gly Phe Arg Met Thr His Thr 

45 20 25 30 

Val Thr Thr lie Ser Tyr Leu Asn Leu Ala Val Ala Asp Phe Cys Phe 



35 



40 
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35 



40 



45 



10 

15 

20 
25 

30 
35 

(2) 

40 
45 



Thr Ser Thr Leu Pro Phe Phe Met Val Arg Leu Gly His Trp Pro Phe 
50 55 60 

Gly Trp Phe Leu Cys Lys Phe Leu Phe Thr lie Val Asp lie Asn Leu 
65 70 75 80 

Phe Gly Ser Val Phe Leu lie Ala Leu lie Ala Leu Asp Arg Cys Val 
85 90 95 

Cys Val Leu His Pro Val Trp Thr Gin Asn His Arg Thr Val Ser Leu 
100 105 110 

Ala Lys Lys Val lie He Gly Pro Trp Val Met Ala Leu Leu Leu Thr 
115 120 125 

Leu Pro Val He He Arg Val Thr He Val Pro Gly Lys Thr Gly Thr 
130 135 140 

Val Ala Cys Thr Phe Asn Phe Ser Pro Trp Thr Asn Asp Pro Lys Glu 
145 150 155 160 

Arg He Asn Val Ala Val Ala Met Leu Thr Val Arg Gly lie He Arg 
165 170 175 

Phe He He Gly Phe Ser Ala Pro Met Ser He Val Ala Val Ser Tyr 
180 185 190 

Gly Leu He Ala Thr Lys He He Lys Ser Ser Arg Pro Leu Arg Val 
195 200 205 

Leu Ser Phe Val Ala Ala Ala Phe Phe Leu Cys Trp Ser Pro Tyr Gin 
210 215 220 

Val Val Ala Leu He Ala Thr Val Arg He Arg Glu Leu Leu Gin Gly 
225 230 235 240 

Met Tyr Lys Glu He Gly He Ala Val Asp Val Thr Ser Ala He Ala 
245 250 255 

Phe Phe Asn Ser Cys Leu Asn Pro Leu Tyr Val Phe Met Gly Gin Asp 
260 265 270 

Phe Arg Glu Arg Leu He His Ala Leu Pro Ala Ser Leu Glu Arg Ala 
275 280 285 

Leu Thr Glu Asp Ser Thr Gin Thr Ser Asp Thr Ala Thr Asn Ser Thr 
290 295 300 

Leu Pro Ser Ala Glu Val Ala Leu Gin Ala Lys 
305 310 315 



INFORMATION FOR SEQ ID NO: 35: 
(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Asp He Leu Ala Leu Val He Phe Ala Val Val Phe Leu Vai Gly Val 

1 5 10 15 

Leu Gly Asn Ala Leu Val Val Trp Val Thr Ala Phe Glu Ala Lys Arg 
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20 



25 



30 



10 



.15 



20 



25 



30 



Thr lie Asn Ala He Trp Phe Leu Asn He Ala Val Ala Asp Phe Leu 
35 40 45 

Ser Cys Leu Ala Leu Pro He Leu Phe Thr Ser He Val Gin His His 
50 55 60 

His Trp Pro Phe Gly Gly Ala Ala Cys Ser lie Leu Pro Ser Leu lie 
65 70 75 80 

Leu Leu Asn Met Tyr Ala Ser He Leu Leu Leu Ala Thr lie Ser Ala 
85 90 95 

Asp Arg Phe Leu Leu Val Phe Lys Pro lie Trp Cys Gin Asn Phe Arg 
100 105 110 

Gly Ala Gly Leu Ala Trp lie Ala Cys Ala Val Ala Trp Gly He Ala 
115 120 125 

Leu Leu Leu Thr He Pro Ser Phe Leu Tyr Arg Val Val Ara Glu Glu 
130 135 140 

Tyr Phe Pro Pro Lys Val Leu Cys Gly Cys Asp Tyr Ser His Asp Lys 
145 150 155 160 

Arg Arg Glu Arg Ala Val Ala lie Val Arg Leu Val Leu Gly Phe Leu 
165 170 175 

Trp Pro Leu Leu Thr Leu Thr lie Cys Tyr Thr Thr Arg Ser Thr Lys 
180 185 190 

Thr Leu Lys Val Val Val Ala Val Val Ala Ser Phe Phe lie Phe Trp 
195 200 205 

Leu Pro Tyr Gin Val Thr Gly He Met Met Ser Phe Leu Glu Pro Ser 
210 215 220 

Ser Pro Thr Phe Leu Leu Leu Asn Lys Leu Asp Ser Leu Cys Val Ser 
225 230 235 240 

Phe Ala Tyr lie Asn Cys Cys lie Asn Pro lie lie Tyr Val Val Ala 
245 250 255 

Gly Gin Gly Gin Phe Gin Gly Arg Leu Arg Lys Ser Leu Pro Ser Leu 
260 265 270 

Leu Arg Asn Val Leu Thr Glu Glu Ser Val Val Arg Glu Ser Lys Ser 
275 280 285 

Phe Thr Arg Ser Thr Val Asp Thr Met Ala Gin Lvs Thr Gin Ala Val 
290 295 300 



) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Thr Leu Phe Val Pro Ser Val Tyr Thr Gly Val Phe Val Val Ser Leu 

15 10 15 



Pro Leu Asn lie Met Ala He Val Val Phe lie Leu Lys Met Lys Val 
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20 25 30 

Lys Lys Pro Ala Val His lie Ala Thr Ala Asp Val Leu Phe Val Ser 
35 40 45 

Val Leu Pro Phe Lys lie Ser Tyr Tyr Phe Ser Gly Ser Asp Trp Gin 
5 50 55 60 

Phe Gly Ser Glu Leu Cys Arg Phe Val Thr Ala Ala Phe Tyr Cys Asn 
65 70 75 80 

Met Tyr Ala Ser lie Leu Leu He Ser He Asp Arg Phe He Ala Val 
85 90 95 

10 Val Tyr Pro Met Gin Ser Leu Ser Trp Arg Thr Leu Gly Arg Ala Ser 

100 105 110 

Phe Thr Cys He Ala He Trp Ala lie Ala He Ala Gly Val Pro Leu 
115 120 125 

Val Leu Lys Glu Gin Thr He Gin Val Pro Gly Leu Asn He Thr Thr 
15 130 135 140 

He Cys His Asp Val Leu Asn Glu Thr Leu Leu Glu Gly Tyr Tyr Ala 
145 150 155 ISO 

Tyr Tyr Phe Ser Ala Phe Ser Ala Val Phe Phe Phe Val Pro Leu He 
165 170 175 

20 He Ser Thr Val Cys Tyr Val Ser He He Arg Cys Leu Ser Ser Ser 

180 185 190 

Ala Val Ala Asn Arg Ser Lys Lys Ser Arg Thr Asn Arg Cys Phe Asn 
195 200 205 

Ser Thr Val Ala Leu Phe Leu Ser Ala Ala Val Phe Cys He Phe He 
25 210 215 220 

He Cys Phe Gly Pro Thr Trp Leu Leu He Ala His Tyr Ser Phe Leu 
225 230 235 240 

Ser His Thr Ser Thr Thr Glu Ala Ala Tyr Phe Ala Tyr Leu Leu Cys 
245 250 255 

Val Cys Val Ser Ser He Ser Ser Cys He Asp Pro Leu He Tyr Tyr 
260 265 270 

Tyr Ala Ser Ser Glu Cys Gin Arg Tyr Val Tyr Ser He Leu Cys Cvs 
275 ' 280 285 

Lys Glu Ser Ser Asp Pro Ser Ser Tyr Asn Ser Ser Gly Gin Leu Met 
3d 290 295 300 

Ser Leu Thr Cys Ser Ser Asn Leu Asn Asn Ser He Tyr Lys Lys Leu 
305 310 315 320 

Leu Thr 



30 



4 0 (2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
4 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DeDtide 



' o o 
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{xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 37: 

Tyr lie Asn Thr Val lie Ser Cys Thr He Phe He Vai Gly Trp Glv 
1 5 10 15 

Asn Ala Thr Leu Leu Arg He He Tyr Gin Asn Lys Cys Met Arg Asn 
5 20 25 30 

Gly Pro Asn Ala Leu He Ala Ser He Ala Leu Gly Asp Leu lie Tyr 
35 40 45 

Val Val He Asp Leu Pro He Asn Val Pro Lys Leu He Ala Gly Arg 
50 55 60 

10 Trp Pro Phe Glu Gin Asn Asp Phe Gly Val Phe Cys Lys Phe Met Gly 

65 70 75 80 

Val Val Met He Phe Phe Gly Leu Ser Pro Leu Leu Leu Gly Ala Ala 
85 90 95 

Met Ala Ser Glu Arg Tyr Leu Gly lie Thr Arg Pro Phe Ser Arg Pro 
15 100 105 110 

Ala Val Ala Ser Gin Arg Arg Ala Trp Ala Thr Val Gly Leu Val Trp 
115 120 125 

Ala Ala Ala Leu Ala Leu Gly Leu Leu Pro Leu Leu Gly Val Gly Arg 
130 135 140 

2 0 Tyr Thr Val Gin Tyr Pro Gly Ser Trp Cys Phe Leu Thr Leu Gly Ala 

145 150 155 160 

Glu Ser Gly Asp Val Ala Phe Gly Leu Leu Phe Ser Gly Lev Ser Val 
165 170 175 

Gly Leu Ser Phe Leu Leu Asn Thr Val Ser Val Ala Thr Leu His His 
25 180 185 190 

Val Tyr His Gly Gin Glu Ala Ala Gin Gin Arg Pro Arg Asp Ser Glu 
195 200 205 

Val Glu Met Met Ala Gin Leu Leu Gly He Met Val Val Ala Ser Val 
210 215 220 

3 0 Cys Trp Leu Pro Leu Leu Val Phe lie Ala Gin Thr Val Leu Arg Asn 

225 230 235 240 

Pro Pro Ala Met Ser Pro Ala Gly Gin Leu Ser Arg Thr Thr Glu Lys 
245 250 255 

Glu Leu Leu He Tyr Leu Arg Val Ala Thr Trp Asn Gin lie Leu Asp 

35 260 265 270 

Pro Trp Val Tyr lie Leu Phe Arg Arg Ala Val Leu Arg Arg Leu Gin 
275 280 285 

Pro Arg Leu Ser Thr Arg Pro Arg Ser Leu Ser Leu Gin Pro Gin Leu 
290 295 300 

4 0 Thr Gin Arg Ser Gly Leu Gin 

305 310 

(2) INFORMATION FOR SEQ ID NO: 38: 
(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 
4 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Lys Tyr Phe Val Val He He Tyr Ala Leu Val Phe Leu Leu Ser Leu 
1 5 10 15 

5 Leu Gly Asn Ser Leu Val Met Leu Val He Leu Tyr Ser Arg Gly Val 

20 25 30 

Arg Ser Val Thr He Val Tyr Leu Leu Asn He Ala He Ala Asp Leu 
35 40 45 

Leu Phe Ala Leu Thr Leu Pro He Trp Ala Ala Ser Lys Val Asn Gly 
10 50 55 60 

Trp lie Phe Gly Thr Phe Leu Cys Lys Trp Ser Leu Leu Lys Glu Val 
65 70 75 80 

Asn Phe Tyr Ser Gly He Leu Leu Leu Ala Cys He Ser Val Asp Arg 
85 90 95 

15 Tyr Leu Ala He Val Arg Ala Thr Arg Thr Leu Thr Gin Lys Arg His 

100 105 llu 

Leu Val Lys Phe He Cys Leu Ser lie Trp Gly Leu Ser Leu Leu Leu 
115 120 125 

Ala Leu Pro Val Leu Leu Phe Arg Arg Thr Val Tyr Ser Ser Asn Val 
20 130 135 140 

Ser Pro Ala Cys Tyr Glu Asp Met Gly Asn Asn Tyr Ala Asn Trp Arg 
145 150 155 160 

Met Leu Leu Pro He Leu Pro Gin Ser Phe Gly Phe He Val Pro Leu 
165 170 175 

25 Leu lie Met Leu Tyr Cys Tyr Gly Phe Thr Leu Arg Thr Leu Phe Lys 

180 185 190 

Ala He Met Gly Gin Lys His Arg Ala Met Arg Val He Phe Ala Val 
195 200 205 

Val Leu He Phe Leu Leu Cys Trp Leu Pro Tyr Asn Leu Val Leu He 
30 210 215 220 

Ala Asp Thr Leu Met Arg Thr Gin Val He Gin Glu Thr Cys Glu Arg 
225 230 235 240 

Arg Asn His lie Asp Arg Ala He Asp Ala Thr Glu He Leu Gly He 
245 250 255 

3 5 Leu His Ser Cys Leu Asn Pro Leu He Tyr Ala Phe He Gly Gin Lys 

260 265 270 

Phe Arg His Gly Leu Leu Lys He Leu Ala lie His Gly Leu He Ser 
275 280 285 

Lys Asp Ser Leu Pro Lys Asp Ser Arg Pro Ser Phe Val Gly Ser Ser 
40 290 295 300 

Ser Gly His Thr Ser Thr Thr Leu 
305 310 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS : 

4 5 (A) LENGTH: 326 amino acids 

(B) TYPE: amino acid 



o 
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10 



15 



20 



25 



30 



35 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Leu Phe Pro He Val Tyr Ser He He Phe Val Leu Gly He He Ala 

15 10 15 

Asn Gly Tyr Val Leu Trp Val Phe Ala Arg Leu Tyr Pro Ser Lys Lys 
20 25 30 

Asn Glu lie Lys He Phe Met Val Asn Leu Thr Val Ala Asp Leu Leu 
35 40 45 

Phe Leu He Thr Leu Pro Leu Trp lie Val Tyr Tyr Ser Asn Gin Gly 
50 55 60 

Asn Trp Phe Leu Pro Lys Phe Leu Cys Asn Leu Ala Gly Cys Leu Phe 
65 70 75 80 

Phe He Asn Thr Tyr Cys Ser Val Ala Phe Leu Gly Val He Thr Tyr 
85 90 95 

Asn Arg Phe Gin Ala Val Lys Tyr Pro lie Lys Thr Ala Gin Ala Thr 
100 105 110 

Thr Arg Lys Arg Gly He Ala Leu Ser Leu Val lie Trp Val Ala lie 
115 120 125 

Val Ala Ala Ala Ser Tyr Phe Leu Val Met Met Asp Ser Thr Asn Val 
130 135 140 

Val Ser Asn Lys Ala Gly Ser Gly Asn He Thr Arg Cys Phe Glu Arg 
145 150 155 160 

Tyr Glu Lys Gly Ser Lys Pro Val Leu He lie His lie Cys lie Val 
165 170 175 

Leu Gly Phe Phe He Val Phe Leu Leu lie Leu Phe Cys Asn Leu Val 
180 185 190 

lie lie His Thr Leu Leu Arg Gly Pro Val Lys Gin Gin Arg Asn Ala 
195 200 205 

Glu Val Arg Arg Arg Ala Leu Trp Met Val Cys Thr Val He Ala Val 
210 215 220 

Phe Val lie Cys Phe Val Pro His His Met Val Gin Leu Pro Trp Thr 
225 230 235 240 

Leu Ala Glu Leu Gly Met Trp Pro Ser Ser Asn His Gin Ala He Asn 
245 250 255 

Asp Ala His Gin Val Thr Leu Cys Leu Leu Ser Thr Asn Cys Val Leu 
260 265 270 

Asp Pro Val lie Tyr Cys Phe Leu Thr Lys Lys Phe Arg Lys His Leu 
275 280 285 

Ser Gl u Lys Leu Asn lie Met Arg Ser Ser Gin Lvs Cys Ser Arg val 
2SV 1 ; 295 300 

Thr Arg Asp Thr Gly Thr Glu Met Ala He Pro lie Asn His Thr Pro 

305 310 315 320 

Val Asn. Pro lie Lys Asn 



WO 94/05695 



PCI7US93/08528 



- 94 - 

325 

(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
10 Tyr He Asn Thr He Val Ser Cys Leu Val Phe Val Leu Gly He He 

15 10 15 

Gly Asn Ser Thr Leu Leu Arg He He Tyr Lys Asn* Lys Cys Met Arg 
20 25 30 

Asn Gly Pro Asn He Leu He Ala Ser He Ala Leu Gly Asp Leu Leu 
15 35 40 45 

His He He He Asp He Pro He Met Ala Tyr Lys Leu He Ala Gly 

50. 55 60 

Asp Trp Pro Phe Ala Cys Lys Leu Phe Pro Phe Leu Gin Lys Ser Ser 
65 70 75 80 

2 0 Val Gly He Thr Val Leu Asn Leu Cys Ala Leu Ser Val Asp Arg Tyr 

85 90 95 

Arg Ala Val Ala Ser Trp Ser Arg Val Gin Gly He Gly He Pro Leu 
100 105 110 

Val Thr Ala He Glu He Val Ser He Trp He Leu Ser Phe He Leu 
25 H5 120 125 

Ala He Pro Glu Ala He Gly Phe Trp Met Val Pro Phe Glu Tyr Lys 
130 135 140 

Gly Ala Gin His Arg Thr Cys Met Leu Asn Ala Thr Ser Lys Leu Phe 
145 150 155 160 

3 0 Tyr Gin Asp Val Lys Asp Trp Trp Leu Phe Gly Phe Tyr Phe Leu Leu 

165 170 175 

Val Cys Thr Ala He Phe Tyr Thr Leu Met Thr Cys Glu Met Leu Asn 
180 185 190 

Arg Arg Asn Gly Ser Leu Arg He Ala Leu Ser Glu His Leu Lys Gin 
35 195 200 205 

Arg Arg Glu Val Ala Lys Thr Val Phe Cys Leu Val Val He Phe Ala 
210 215 220 

Leu Cys Trp Phe Pro Leu His Leu Ser Arg He Leu Lys Lys Thr Val 
225 230 235 240 

4 0 Tyr Asp Glu Met Asp Thr Asn Arg Cys Glu Leu Leu Ser Phe Leu Leu 

245 250 255 

Leu Met Tyr lie Gly He Asn Thr Ala Thr Met Ser Cys He Asn Pro 
260 265 27G 

He Ala Leu Tyr Phe Val Ser Lys Lys Phe Lys Asn Cys Phe Gin Ser 
45 275 280 285 

Cys Leu Cys Cys Cys Cys Tyr Gin Ser Lys Ser lie Met Thr Ser Val 
290 295 300 
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Pro Met Gin Gly Thr Ser lie Gin Trp Lys Asn His Glu Gin Asn Asn 
305 310 315 320 

His Asn Thr Glu Arg Ser Ser Hxs Lys Asp Ser He Asn 
325 330 

5 (2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

Leu He Ala Ser Pro Trp Phe Ala Ala Ser Phe Cys Val Val Gly Leu 

15 10 15 

15 Ala Ser Asn Leu Leu Ala Leu Ser Val Leu Ala Gly Ala Arg Gin Ser 

20 25 30 

Ser Ser His Thr Arg Ser Ser Phe Leu Thr Phe Leu Cys Gly Leu Val 
35 40 45 

Leu Thr Leu Asp Phe Leu Gly Leu Leu Val Thr Gly Thr II* Val Val 
20 50 55 60 

Ser Gin His Ala Ala Leu Phe Glu Trp His Ala Val Asp Pro Gly Cys 
65 70 75 80 

Arg Leu Cys Arg Leu Val Pro Phe He Gin Lys Ala Ser Val Gly He 
85 90 95 

25 Thr Val Leu Ser Leu Cys Ala Leu Ser He Asp Arg Tyr Arg Ala Val 

100 105 110 

Ala Ser Trp Ser Arg He Lys Gly He Gly Val Pro Lys Trp Thr Ala 
115 120 125 

Val Glu He Val Leu He Trp Val Val Ser Val Val Leu Ala Val Pro 
30 130 135 140 

Glu Ala He Gly Phe Asp Thr Thr Ser Asp Tyr Lys Gly Lys Pro Leu 
145 150 155 160 

Arg Val Cys Met Leu Asn Pro Phe Gin Lys Thr Ala Phe Met . Phe Tyr* 
165 170 175 

3 5 Lys Thr Ala Ala Lys Asp Trp Trp Leu Phe Ala Phe Tyr Phe Cys Leu 

180 185 190 

Pro Leu Ala He Thr Ala lie Phe Tyr Thr Leu Met Thr Cys Glu Met 
195 200 205 

Leu Arg Lys Lys Ser Gly Met Gin He Ala Leu Asn Asp His Leu Lys 
40 210 215 220 

Gin Arg Arg Glu Val Ala Lys Thr Val Phe Cys Leu Val Leu Val Phe 
225 230 235 240 



Ala Leu Cys Trp Leu Pro Leu His Leu Ser Arg He Leu Lys Leu Thr 
1 ZT5~ 250 255 



4 5 Leu Tyr Asp Gin Ser Asn Pro Gin Arg Cys Glu Leu Leu Ser Phe Leu 

260 265 270 

Leu Val Leu Asp Tyr He Gly He Asn Met Ala Ser He Asn Ser Cys 
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275 280 285 

lie Asn Pro lie Ala Leu Tyr Leu Val Ser Lys Arg Phe Lys Asn Cys 
290 295 300 

Phe Lys Ser Cys Leu Cys Cys Trp Cys Gin Thr Phe Glu Glu Lys Gin 

5 305 310 315 320 

Ser Leu Glu Glu Lys Gin Ser Cys Leu Lys Phe Lys Ala Asn Asp His 
325 330 335 

Gly Tyr Asp Asn Phe Arg Ser Ser Asn Lys Tyr Ser Ser Ser 
340 345 350 

10 (2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

He Tyr Val He Pro Ala Val Tyr Gly Leu He He Val He Gly Leu 
15 10 15 

2 0 He Gly Asn He Thr Leu He Lys He Phe Cys Thr Val Lyj Ser Leu 

20 25 30 

Asn Leu Phe He Ser Ser He Ala Leu Gly Asp Leu Leu Leu Leu Val 
35 40 45 

Thr He Cys Ala Pro Val Asp Ala Ser Lys Tyr He Ala Asp Arg Trp 
25 50 55 60 

Leu Phe Gly Arg He Gly Cys Lys Leu He Pro Phe He Gin Leu Thr 

65 70 75 80 

Ser Val Gly Val Ser Val Phe Thr Leu Thr Ala Leu Ser Ala Asp Arg 
85 90 95 

3 0 Tyr Lys Ala He Val Arg Pro Thr Cys He Gin Ala Ser Leu He Cys 

100 105 110 

Leu Lys Ala Ala Leu He Trp He Val Ser Leu Leu Ala He Pro Glu 
115 120 125 

Ala Val Phe Ser Asp Leu His Pro Phe His Val Lys Asp Thr Asn Gin 
35 130 135 140 

Thr Phe He Ser Cys Ala Pro Tyr Pro His Ser Asn Glu Leu His Pro 
145 150 155 160 

Lys He His Ser Met Ala Ser Phe Leu Val Phe Tyr Val He Pro Leu 
165 170 175 

4 0 Ala He He Ser Val Tyr Tyr Tyr Phe He Ala Arg Asn Leu He Gin 

180 185 190 

Ser Ala Tyr Asn Leu Pro Val Glu Gly Asn He His Val Lys Lys Gin 
195 200 205 

He Glu Ser Arg Lys Arg Leu Ala Lys Thr Val Leu Val Ph* Val Gly 
45 210 215 220 

Leu Phe Ala Phe Cys Trp Leu Pro Asn His Val He Tyr Leu Tyr Arg 
225 230 235 240 
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Ser Tyr His Tyr Ser Glu Val Asp Thr Ser Met Leu His Phe Val Thr 
245 250 255 

Ser He Cys Ala Arg Leu Leu Ala Pro Thr Asn Ser Cys Val Asn Pro 
260 265 270 

5 Phe Ala Leu Tyr Leu Leu Ser Lys Ser Phe Arg Gin Phe Asr. Thr Gin 

275 280 285 

Leu Leu Cys Cys Gin Pro Gly Leu Ser His Ser Thr Gly Arg Ser Leu 
290 295 300 

Ser Phe Lys Ser Thr Asn Pro Ser Ala Thr Phe Ser Leu lie Asn Arg 
10 305 310 315 320 

Asn He Cys His Glu Gly Tyr Val 
325 

(2) INFORMATION FOR SEQ ID NO:43: 
(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Cys Val He Pro Ser Ser Leu Tyr Leu He He He Ser Val Gly Leu 
15 10 15 

Leu Gly Asn He Met Leu Val Lys He Phe Leu Thr Asn Ser Thr Met 

20 25 30 

25 Arg Ser Val Pro Asn He Phe He Ser Asn He Ala Ala Gl" Asp Leu 

35 40 45 

Leu Leu Leu Leu Tnr Cys Val Pro Val Asp Ala Ser Arg Tyr Phe Phe 
50 55 60 

Asp Glu Trp Val Phe Gly Lys Leu He Gly Cys Lys Leu He Pro Ala 
30 65 70 75 80 

He Gin Leu Thr Ser Val Gly Val Ser Val Pro Thr Leu Thr Ala Leu 
85 90 95 

Ser Ala Asp Arg Tyr Arg Ala He Val Asn Pro Met Asp Met Thr Ser 
100 105 110 

3 5 Gly Val Val Leu Trp Thr Ser Val Ala Val Gly He Trp Val Val Ser 

115 120 125 

Val Leu Leu Ala Val Pro Glu Ala Val Phe Ser Glu Val Ala Arg He 
130 135 140 

Gly Ser Ser Asp Asn Ser Ser Phe Thr Ala Cys He Pro Tyr Pro Gin 
40 145 150 155 160 

Thr Asp Glu Leu His Pro Lys He His Ser Val Leu He Phe Leu Val 
165 170 175 

Tyr Phe Leu He Pro Leu Val H p Hp Q^-r n 0 t^t- t^t- Ty • Hi n Tie 

180 185 190 



4 5 Ala Lys Thr Leu lie Arg Ser Ala His Asn Leu Pro Gly Glu Tyr Asn 

195 200 205 

Glu His Thr Lys Lys Gin Met Glu Thr Arg Lys Arg Leu Ala Lys He 
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210 215 220 

Val Leu Val Phe Val Gly Cys Phe Val Phe Cys Trp Phe Pro Asn His 
225 230 235 240 

lie Leu Tyr Leu Tyr Arg Ser Phe Asn Tyr Lys Glu lie Asp Pro Ser 
5 245 250 255 

Leu Gly Thr Cvs Val Thr Leu Val Ala Arg Val Leu Ser Phe Ser Asn 
260 265 270 

Ser Cys Val Asn Pro Phe Ala Leu Tyr Leu Leu Ser Glu Ser Phe Arg 
275 280 285 

10 Lys His Phe Ser Asn Gin Leu Cys Cys Gly Gin Lys Ser Tyr Pro Glu 

290 295 300 

Arg Ser Thr Ser Tyr Leu Leu Ser Ser Ser Ala Val Trp Arg Ser Leu 
305 310 315 320 

Lys Ser Asn Ala Lys Asn Val Val Thr Asn Ser Val Leu lie Asn Gly 
15 325 330 335 

His Ser Thr Lys Gin Glu lie Ala Leu 
340 345 

(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE; peptide 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

Tyr Thr Leu Ser Phe lie Tyr lie Phe He Phe Val He Cys Glx Leu 
15 10 15 

Leu Ala Asn Ser Val Val Val Trp Val Asn He Gin Ala Lys Thr Thr 
20 25 30 

3 0 Gly Tyr Asp Thr His Cys Tyr He Leu Asn Leu Ala He Ala Asp Leu 

35 40 45 

Trp Trp Leu Thr He Pro Val Trp Trp Ser Leu Val Gin His Asn Gin 
50 55 60 

Trp Pro Met Gly Glu Leu Thr Cys Lys Val Thr His Leu He Phe Ser 
35 65 70 75 80 

lie Asn Leu Phe Ser Gly lie Phe Phe Leu Thr Cys Met Ser Val Asp 
85 90 95 

Arg Tyr Leu Ser lie Thr Tyr Phe Thr Asn Thr Pro Ser Sei Arg Lys 
100 105 110 

4 0 Lys Met Val Arg Arg Ala Val Cys He Leu Val Trp Leu Leu Ala Phe 

115 120 125 

Cys Val Ser Leu Pro Asp Thr Tyr Tyr Leu Lys Thr Val Thr Ser Ala 
130 135 140 

Ser Asn Asn Glu Thr Tyr Cys Aro Ser Phe Tyr Pro Glu His Ser lie 
45 145 150 155 160 

Lys Glu Trp Leu lie Ser Leu Leu Val Ser Val Val Leu He Gly Phe 
165 170 175 
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Ala Val Pro Phe Ser lie He Ala Val Phe Tyr Phe Ser Leu He Ala 
180 185 190 

Arg Ala He Ser Ala Ser Ser Asp Gin Glu Lys His Ser Ser Arg Lys 
195 200 205 

5 He He Phe Ser Tyr Val Val Val Phe Leu Val Cys Trp Leu Pro Tyr 

210 215 220 

His Val Ala Val Leu Leu Asp He Phe Ser He Leu His Tyr He Pro 
225 230 235 240 

Phe Thr Cys Arg Leu Glu His Ala Leu Phe Thr Ala Leu His Val Thr 
10 245 250 255 

Gin Cys Leu Ser Leu Val His Cys Cys Val Asn Pro Val Leu Tyr Ser 
260 265 270 

Phe He Asn Arg Asn Tyr Arg Tyr Glu He Asn Trp He Phe Lys Tyr 
275 280 285 

15 Ser Ala Lys Thr Gly Leu Thr Lys Leu lie Asp Ala Ser Arg Val Ser 

290 295 300 

Glx Thr Glu Tyr Ser Ala Leu Glu Gin Asn Ala Lys 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 45: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Lys Val Leu Val Thr Ala He Tyr Leu Ala Leu Phe Val Va. Gly Thr 
15 10 15 

Val Gly Asn Ser Val Thr Ala Phe Thr Leu Ala Arg Lys Lys Ser Leu 
30 20 25 30 

Gin Ser Leu Gin Ser Thr Val His Tyr His Leu Ser Ser Leu Ala Leu 
35 40 45 

Ser Asp Leu Leu He Leu Leu Trp Val Glu Leu Tyr Asn Phe He Trp 
50 55 60 

3 5 His His Pro Trp Ala Phe Gly Asp Ala Gly Cys Arg Gly Tyr Tyr Phe 

65 70 75 80 

Leu Arg Asp Ala Cys Thr Tyr Ala Thr Ala Leu Asn Val Ala Ser Leu 
85 90 95 

Ser Val Glu Arg Tyr Leu Ala lie Cys His Pro Phe Lys Ala Lys Thr 
40 100 105 110 

Leu Met Ser Arg Ser Arg Thr Lys Lys Phe He Ser Ala He Trp Leu 
115 120 125 



Ala Ser Ala LPIl T.P1I Ala TT a Dm Mnr Ton Hhn Thr> T . cn HI y T.uh , m .i 

130 135 . 140 



4 5 Asn Arg Ser Gly Asp Gly Thr His Pro Gly Glv Leu Val Cys Thr Pro 

145 150 155 160 

lie Val Asp Thr Ala Thr Val Lys Val Val He Gin Val Asn Thr Phe 
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165 170 175 

Met Ser Phe Leu Phe Pro Met Leu Val lie Ser lie Leu Asn Thr Val 
180 185 190 

lie Ala Asn Lys Leu Thr Val Met Val His Gin Ala Ala Glu Gin Gly 
5 195 200 205 

Arg Val Cys Thr Val Gly Thr His Asn Gly Leu Glu His Ser Thr Phe 
210 215 220 

Asn Met Arg lie Glu Pro Gly Arg Val Gin Ala Leu Arg His Gly Val 
225 230 235 240 

10 Leu Val Leu Arg Ala Val Val He Ala Phe Val Val Cys Trp Leu Pro 

245 250 255 

Tyr Leu Cys Tyr He Ser Asp Glu Gin Trp Arg Thr Phe Leu Phe Asp 
260 265 270 

Phe Tyr His Tyr Phe Tyr Met Leu Thr Asn Ala Leu Phe Tyr Val Ser 
15 275 280 285 

Ser Ala He Asn Pro He Leu Tyr Asn Leu Val Ser Ala Asn Phe Arg 
290 295 300 

Gin Val Phe Leu Ser Thr Leu Ala Cys Leu Phe Cys Pro Gly Trp Pro 
305 310 315 320 

20 Leu He Arg Arg Lys Lys Arg Pro Thr Phe Ser Arg Lys Pro Asn Ser 

325 330 335 

Met Ser Ser Asn His Ala Phe Ser Thr Ser Ala Thr Arg Phe Thr Leu 
340 345 350 



25 



Tyr 



(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Ala He Gin Ala Pro Phe Leu Trp Val Leu Phe Leu Leu Ala Ala Leu 
35 l 5 10 15 

Glu Asn He Phe Val Leu Ser Val Phe Cys Leu His Lys Thr Asn Cys 
20 25 30 

Thr Val Ala Glu He Tyr Leu Gly Asn He Ala Ser Ala Asp Leu He 
35 40 45 

4 0 He Ala Cys Gly Leu Pro Phe Trp Ala He Thr He Ala Asn Asn Phe 

50 55 60 

Asp Trp Leu Phe Gly Glu Val Leu Cys Arg Val Val Asn Leu Tyr Met 
65 70 75 80 

Asn Leu Tyr Ser Ser He Cys Phe Leu Val Ser He Asp Arq Tyr Leu 
45 85 90 95 



Ala Leu Val Lys Thr Met Ser Asn Leu Arg Trp Ala Lys Leu Tyr Ser 
100 105 no 
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Leu Val He Trp Ser Cys Thr Leu Leu Leu Ser Ser Pro Met Leu Vai 
115 120 125 

Phe Arg Thr Met Tyr Arg Glu Glu Gly His Asn Val Thr Cys Val He 
130 135 140 

5 Val Tyr Pro Ser Arg Ser Trp Glu Val Phe Leu Leu Asn Leu Val Gly 

145 150 155 160 

Phe Leu Leu Pro Leu Ser He He Thr Phe Cys Thr Val Arg lie Met 
165 170 175 

Val Leu Arg Asn Asn Glu Met Lys Lys Phe Lys Glu Val Gin Thr Glu 
10 180 185 190 

Lys Lys Ala Thr Val Leu Val He Ala Val Leu Gly Leu Phe Val Leu 
195 200 205 

Cys Trp Phe Pro Phe Gin He Ser Thr Phe Leu Asp Thr Leu Leu Arg 

210 215 220 

15 Leu Gly Val Leu Ser Gly Cys Trp Asn Glu Arg Ala Val Asp He Val 

225 230 235 240 

Arg Gin He Ser Ser Tyr Val Ala Tyr Ser Asn Ser Cys Leu Asn Pro 
245 250 255 

Leu Val Tyr Val He Val Gly Lys Arg Phe Arg Lys Lys Ser Arg Glu 
20 260 265 270 

Val Tyr Gin Ala He Cys Arg Lys Gly Gly Cys Met Gly Glu Ser Val 
275 280 285 

Leu Asn Ser Met Gly Thr Leu Arg Thr Ser He Ser Val Asp Arg Gin 
290 295 300 

25 He His Lys Leu Gin Asp Trp Ala Gly Asn Lys Gin 

305 310 315 

(2) INFORMATION FOR SEQ ID NO: 47: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 7 amino acids 

3 0 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
35 He Leu Leu Val Val He He Cys Gly Leu Gly He Val Gl> Asn He 

1 r - 10 15 

Met Val Val Leu Val Val Met Arg Thr Thr Pro Thr Asn Cys Tyr Leu 
20 25 30 

Val Ser He Ala Val Ala Asp Leu Met Val Leu Val Ala Ala Gly Leu 
4° 35 40 45 

Pro Asn He Thr Asp Ser He Tyr Glv Ser Trp Val Tyr Gly Tyr Val 
SO 55 60 

Gly Cys Leu Cys ne rnr Tyr Leu Gin Tyr Leu Gly lie Asn Ala Ser 

fi 5 70 75 80 

4 5 Ser Cys Ser lie Thr Ala Phe Thr lie Glu Arg Tvr lie Ala lie Cys 

85 90 95 



His Pro He Lys Ala Gin Phe Leu Cys Thr Phe Ser Arg Ala Lys 



Lys 
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100 105 110 

lie lie He Phe Val Trp Ala Phe Thr Ser He Tyr Leu Phe Leu Leu 
115 120 125 

Asp He Asn He Ser Thr Tyr Lvs Asn Ala Val Val Val Ser Cys Gly 
5 130 135 140 

Tyr Lys He Ser Arg Asn Tyr Tyr Ser Pro He Tyr Leu Met Asp Phe 
145 150 155 160 

Gly Val Phe Tyr Val Val Pro Leu He Ala Thr Val Leu Tyr Gly Phe 
165 170 175 

10 lie Ala Arg He Leu Phe Leu Asn Pro lie Pro Ser Asp Pro Lys Glu 

180 185 190 

Asn Ser Lys Met Trp Lys Asn Asp Ser He His Gin Asn Lys Asn Leu 
195 200 205 

Asn Leu Asn Ala Ser Ser Arg Lys Gin Val Thr lie Asn Leu Ala Val 
15 210 215 ' 220 

Val Val He Leu Phe Ala Leu Leu Trp Asn Thr Tyr Arg Thr Leu Val 
225 230 235 240 

Val Val Asn Ser Phe Leu Ser Ser Pro Phe Gin Glu Asn Trp Lys Leu 
245 250 255 

2 0 Leu Lys Cys Arg lie Cys He Tyr Leu Asn Ser Ala He Asn Pro Val 

260 265 270 

He Tyr Asn He Met Ser Gin Lys Arg Phe Ala Ala Phe Arg Lys Leu 
275 280 285 

Cys Asn Cys Lys Gin Lys Pro Thr Glu Lys Ala Ala Asn Tyr Ser Val 
25 290 295 300 

Ala Leu Asn Tyr Ser Val lie Lys Glu Ser Asp Arg Phe Ser Thr Glu 
305 310 315 320 

Leu Glu Asp He Thr Val Thr Asp Thr Tyr Val Ser Thr Thr Lys Val 
325 330 335 

3 0 Ser Phe Asp Asp Thr Cys lie Ala Ser Glu Asn 

340 345 

(2) INFORMATION FOR SEQ ID NO: 48: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 amino acids 

3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(li) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

4 0 Leu Ala Leu Trp Ala Thr Ala Tyr Leu Ala Leu Val Leu Va- Ala Val 

1 5 10 15 

Thr Gly Asn Ala lie Val lie Trp lie 11 Leu Ala His Arg Arg Met 
20 25 30 

Arg Thr Val Thr Asn Tvr Phe lie Val Asn lie Ala Leu Ala Asd Leu 
45 35 40 45 

Leu Asn Ala Ala Phe Asn Phe Val Tyr Ala Ser His Asn He Trp Tyr 
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50 55 60 

Phe Gly Arg Ala Phe Cys Tyr Phe Gin Asn Leu Phe Pro He Thr Ala 
65 70 ^5 80 

Met Phe Val Ser He Tyr Ser Met Thr Ala He Ala Ala Asp Arg Tyr 
85 90 95 

Met Ala He Val His Pro Phe Gin Pro Arg Leu Ser Ala Pro Ser Thr 
100 105 110 

Lys Ala Val He Ala Gly He Trp Leu Val Ala He Lys Leu Ala Phe 
115 120 125 

Pro Gin Cys Phe Tyr Ser Thr Val Thr Met Gin Gly Ala Thr Lys Cys 
130 135 140 

Val Val Ala Trp Pro Glu Asp Ser Gly Gly Lys Thr Leu Leu Leu Tyr 
145 150 " 155 160 

His Leu Val Val He Ala Leu He Tyr Phe Leu Pro He Ala Leu Ala 
165 170 175 

Tyr Ser Val He Gly Leu Thr Leu Trp Arg Arg Ala Val Pro Gly His 
180 185 190 

Gin Ala His Gly Ala Asn Leu Arg His Leu Gin Ala Lys Lys Lys Phe 
195 200 205 

Val Lys Thr Met Val Leu Val Val Val Thr Phe Ala He Cys Trp Leu 
210 215 220 

Pro Tyr His Leu Tyr Phe He Leu Gly Ser Phe Gin Glu Asp He Tyr 
225 230 235 240 

Cys His Lys Phe He Gin Gin Val Tyr Leu Ala Leu Phe Trp Leu Ala 
25 245 250 255 

Met Ser Ser Thr Met Tyr Asn Pro He He Tyr Cys Cys Leu Asn His 
260 265 27^ 

Arg Phe Arg Ser Gly Phe Arg Leu Ala Phe Arg Cys Cys Pro Trp Val 
275 280 285 

3 0 Thr Pro Thr Lys Glu Asp Lys Leu Glu Leu Thr Pro Thr Thr Ser Leu 

290 295 300 

Ser Thr Arg Val Asn Arg Cys His Thr Lys Glu Thr Leu Phe Met Ala 
305 310 315 320 

Gly Asp Thr Ala Pro Ser Glu Ala Thr Ser Gly Glu Ala Gly Arg Pro 
35 325 330 335 

Gin Asp Gly Ser Gly 
340 

(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 34 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
di) MOLECULE TYPE: peptide 

4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

He Val Leu Trp Ala Ala Ala Tvr Thr Val He Val Val Arg Ser Val 

1 5 10 15 
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Val Gly Asn Val Val Val lie Trp He He Leu Ala His Lys Arg Met 
20 25 30 

Arg Thr Val Thr Ash Tyr Phe Leu Val Asn He Ala Phe Ala Phe Ala 
35 . 40 45 

5 Leu Asn Thr Trp Asn Phe Thr Tyr Ala Val His Asn Val Trp Tyr Tyr 

50 55 60 

Gly Leu Phe Tyr Cys Lys Phe His Asn Phe Phe Pro He Ala Ala Leu 
65 70 75 80 

Phe Ala Ser He Tyr Ser Met Thr Ala Val Ala Phe Asp Arg Tyr Leu 
10 85 90 95 

He He His Pro Leu Gin Pro Arg Leu Ser Ala Thr Ala Thr Lys Val 
100 105 110 

Val He Phe Val He Trp Val He Ala Leu Leu Leu Ala Ser Pro Gin 
115 120 125 

15 Gly Tyr Tyr Ser Thr Thr Glu Leu Ser Arg Val Val Cys Met He Glu 

130 135 140 

Trp Pro Glu His Pro Asn Arg Thr Tyr Glu Lys Ala Tyr Hi^ He Cys 
145 150 155 160 

Val Thr Val Leu He Tyr Phe Leu Pro Leu Leu Val He Gly Tyr Ala 
20 165 170 175 

Tyr Thr Val Val Gly He Thr Leu Trp Ala Ser Glu He Pro Gly Asp 
180 185 190 

Ser Ser Asp Arg Tyr His Glu Gin Val Ser Ala Lys Arg Lys Val Val 
195 200 205 

2 5 Lys Met He Cys Val Val Val Cys Thr Phe Ala He Cys Trp Leu Pro 

210 215 220 

Phe His Val Phe Phe Leu Leu Pro Tyr He Asn Pro Asp Leu Tyr Leu 
225 230 235 240 

Lys Lys Phe He Gin Gin Val Tyr He Ala Ser Met Trp Leu Ala Met 
30 245 250 255 

Ser Ser Thr Met Tyr Asn Pro He lie Tyr Cys Cys Leu Asn Asp Arg 
260 265 270 

Phe Arg Leu Gly Phe Lys His Ala Phe Arg Cys Cys Pro Phe He Ser 
275 280 285 

3 5 Ala Gly Asp Tyr Glu Gly Leu Glu Met He Lys Ser Thr Arg Tyr Leu 

290 295 300 

Gin Thr Leu Ser Ser Val Tyr Lys Val Ser Arg Leu Glu Thr Thr lie 
305 310 315 320 

Ser Thr Val Val Gly Ala His Glu Glu Glu Pro Glu Glu Gly Pro Lys 
40 325 330 335 

Ala Thr Pro Ser 
340 

(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 336 ammo acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
5 He Ala Leu Trp Ser Leu Ala Tyr Gly Leu- Val Val Ala Val Ala Val 

15 10 15 

Phe Gly Asn Leu He Val He Trp lie He Leu Ala His Lys Arg Met 
20 25 30 

Arg Thr Val Thr Asn Tyr Phe Leu Val Asn Leu Ala Phe Ser Asp Ala 
10 35 40 45 

Ser Val Ala Ala Phe Asn Thr Leu lie Asn Phe He Tyr Gly Leu His 
50 55 60 

Ser Glu Trp Tyr Phe Gly Ala Asn Tyr Cys Arg Phe Gin Asn Phe Phe 
65 70 75 80 

!5 Pro He Thr Ala Val Phe Ala Ser He Tyr Ser Met Ala He Ala Val 

85 90 95 

Asp Arg Tyr Met Ala He He Asp Pro Leu Lys Pro Arg Leu Ser Ala 
100 105 110 

Thr Ala Thr Lys He Val He, Gly Ser He Trp He Leu Ala Phe Leu 
20 us 120 125 

Leu Ala Phe Pro Gin Cys Leu Tyr Ser Lys He Leu Gly Arg Thr Leu 
130 135 140 

Cys Tyr Val Trp Pro Glu Gly Pro Lys Gin His Phe Thr Tyr His He 
145 150 155 160 

2 5 He Val He He Leu Val Tyr Cys Phe Pro Leu Leu He Leu Thr Tyr 

165 170 175 

Thr He Val Gly He Thr Leu Trp Gly Gly Glu He Pro Gly Asp Thr 
180 185 190 

Cys Asp Lys Tyr His Glu Gin Leu Lys Ala Lys Arg Lvs Val Val Met 
30 195 200 205 

Asn He Val Val Val Thr Phe Ala lie Cys Trp Leu Pro Tyr His Val 
210 215 220 

Tyr Phe He Leu Thr Ala He Tyr Gin Gin Leu Asn Arg Trp Lys Tyr 
225 230 235 240 

3 5 He Gin Gin Val Tyr Leu Ala Ser Phe Trp Leu Ala Met Ser Ser Thr 

245 250 255 

Met Tyr Asn Pro He He Tyr Cys Cys Leu Asn Lys Arg Phe Arg Ala 
260 265 270 

Gly Phe Lys Arg Ala Phe Arg Trp Cys Pro Phe lie Gin Val Ser Se^ 
40 275 280 285 



Tyr Asp Glu Leu Glu Leu Ly s Thr Thr Arg Phe His Pro Thr hr- n r,i n 

; — iws 



45 



Ser Ser Leu Tyr Thr Val Ser Phe Met Ser Val Thr Val Leu Phe Asd 

305 310 315 320 

Pro Asn. Asp Gly Asp Pro Thr Lvs Ser Ser Ara Lvs Lvs Ara Ala Va 1 

325 330 ~ 335 
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(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 
-(B) TYPE: amino acid 
5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met He Pro Thr Leu Tyr Ser He He Phe Val Val Gly He Phe Gly 
10 1 5 10 15 

Asn Ser Leu Val Val He Val He Tyr Phe Tyr Met Lys Leu Lys Thr 
20 25 30 

Tyr Ala Ser Val Phe Leu Leu Asn Leu Ala Leu Ala Asp Leu Cys Phe 
35 40 45 

15 Leu Leu Thr Leu Pro Leu Trp Ala Val Tyr Thr Leu Tyr Arg Trp Pro 

50 55 60 

Phe Gly Asn Tyr Leu Cys Lys He Ala Ser Ala Ser Val Ser Phe Asn 
65 70 75 80 

Leu Tyr Ala Ser Val Phe Leu Leu Thr Cvs Leu Ser He Asp Arg Tyr 
20 85 90 95 

Leu Ala He Val His Pro Met Lys Ser Arg Leu Arg Arg Leu Val Ala 
100 105 110 

Lys Val Thr Cys He He He Trp Leu Leu Ala Gly He Ala Ser Leu 
115 120 125 

25 Pro Thr He He His Arg Asn Phe Phe He Glu Asn Thr Asn He Thr 

130 135 140 

Val Cys Ala Phe His Tyr Glu Ser Gin Asn Ser Thr Leu Pro Val Gly 
145 150 155 160 

Leu Gly Leu Thr Lys Asn He Leu Gly Phe Leu Phe Pro Phe Leu He 
30 165 170 175 

He Leu Thr Ser Tyr Thr Leu He Trp Lvs Thr Leu Lys Lys Ala Tyr 
180 185 iSj 

Glu He Gin Lys Asn Lys Pro Arg Lys Asd Asp He Phe Lys He He 
195 200 205 

35 He Ala He Val Leu Phe Phe Phe Phe Ser Trp Val Pro His Asn lie 

210 215 . 220 

Phe Thr Phe Met Val Leu He Gin Leu Glv Leu He Arg Asp Cys Lvs 
225 230 " 235 240 



40 



He Glu Asp He Val Asp Thr Ala Met Pro He Thr He Cys Leu Ala 
245 250 255 

Tyr Phe Gin Gin Asn Leu Asn Pro Leu Phe TVr Glv Phe Leu Gly Lys 
260 265 " * 270 

Lys Phe Lys Lys Tyr Phe Leu His Ala Leu Leu Lys Tyr He Pro Pro 
275 280 285 

4 5 Lys Ala Lys Ser His Ser Asn Leu Ser Thr Lys Met Ser Thr Leu Ser 

290 295 300 
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Tyr Arg Pro Ser Glu Gin Gly Asn Ser Ser Thr Lys Lys Pre Ala Pro 
305 310 315 320 

Cys lie Glu Val Glu 
325 

5 (2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

He Val His Trp Val He Met Ser He Ser Pro Val Gly Phe Val Glu 

15 10 15 

15 Asn Gly He Leu Leu Trp Phe Leu Cys Phe Phe Thr Val Tyr Thr His 

20 25 30 

Leu Ser He Ala Asp He Ser Leu Leu Phe Cys lie Phe He Leu Ser 
35 40 45 

lie Asp Tyr Ala Leu Asp Tyr Glu Leu Ser Ser Gly His Tyr Tyr Thr 
20 50 55 60 

lie Val Thr Leu Ser Val Thr Phe Leu Phe Gly Tyr Asn Thr Gly Leu 
65 70 75 80 

Tyr Leu Leu Thr Ala He Ser Val Glu Arg Cys Leu Ser Val Leu Tyr 
85 90 95 

25 Pro He Trp Tyr Arg Cys His Arg Pro Lys Tyr Gin Ser Ala Leu Val 

100 105 110 

Cys Ala Leu Leu Trp Ala Leu Ser Cys Leu Val Thr Thr Me^ Tyr Val 
115 120 125 

Met Cys lie Asp Arg Phe Glu Glu Ser His Ser Arg Asn Asp Cys Arg 
30 130 135 140 

Ala Val lie lie Phe He Ala lie Leu Ser Phe Leu Val Phe Thr Pro 
145 150 155 160 

Ser Val Ser Ser Thr He Leu Val Val Lys He Arg Lys Asn Thr Trp 
165 170 175 

3 5 Ala Ser His Ser Ser Lys Leu Tyr He Val He Met Val Thr lie lie 

180 185 190 

lie Phe Leu lie Phe Ala Met Pro Met Arg Leu Leu Tyr Leu Leu Tyr 
195 200 205 

Tyr Glu Tyr Trp Ser Thr Phe Glv Asn Leu His His lie Ser Leu Leu 

40 210 215 220 

Phe Ser Thr lie Asn Ser Ser Ala Asn Pro Phe lie Tyr Phe Phe Val 
225 230 235 240 

Hly Ser Ser Lys Lye Lya A rg Phe Lyb> Glu Sm mil Lys Val Val Leu 

245 250 255 

4 5 Thr Arg Ala Phe Lys Asp Qlu Met Gin Pro Arg Arg Gin Lvs Asp Asn 

260 265 270 

Cys Asn Thr Val Thr Val Glu Thr Val Val 
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275 280 

(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
tii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
10 Tyr Asp Phe Leu Arg Val Leu lie Trp Leu lie Asn lie Leu Ala He 

1 5 10 15 

Met Gly Asn Val Met Thr Leu Phe Val Leu Leu Thr Ser Arg Tyr Lys 
20 25 30 

Leu Thr Val Pro Arg Phe He Met Asn Leu Ser Phe Ala Asp Phe Cys 
15 35 40 45 

Met Leu Tyr Leu Leu Leu He Ala Ser Val Asp Ser Gin Thr Lys Gly 
50 55 60 

Gin Tyr Tyr Asn His Ala He Asp Trp Gin Thr Gly Ser Gly Cys Ser 
65 70 75 80 

2 0 Thr Ala Gly Phe Phe Thr Val Leu Ala Ser Glu Leu Ser Val Tyr Thr 

85 90 95 

Leu Thr Val He Thr Leu Glu Arg Trp His Thr He Thr Tyr Ala He 
100 105 HO 

„ His He Asp Gin Lys Leu Arg Leu Arg His Ala He Leu He Met Leu 

25 115 120 125 

Gly Gly Trp Leu Phe Ser Ser Leu He Ala Met Leu Pro Leu Val Cys 
130 135 140 

Val Ser Asn Tyr Met Lys Val Ser He Cys Leu Pro Met Val Glu Thr 
145 150 155 160 

Thr Leu Ser Gin Val Tyr He Leu Thr He Leu He Leu Asn Val Val 
165 170 175 

Ala Phe Leu He He Cys Ala Cys Tyr He Lys He Tyr Phe Ala Val 
180 185 19/ 

Arg Asn Pro Glu He Met Ala Thr Asn Lys Asp Thr Lys He Ala Leu 
35 195 200 205 

Ala He Leu He Phe Thr Asp Phe Thr Cys Met Pro He Ser Phe Phe 
210 215 220 

Ala He Ser Ala Ala Phe Lys Val Pro Leu He Val Thr Asn Ser Lys 
225 230 235 2 40 

40 Val L eu Leu Val Leu Phe Tyr Pro He Asn Ser Cys Ala Asn Pro Phe 

245 250 255 

Leu Tyr Ala lie Phe Thr Lys Thr Phe Gin Arg Asp Phe Phe He Leu 
260 265 270 

Ser L Y S phe Cys Cys Lys Arg Arg Ala Asp He Tyr Arg Ar<~ Lys Asd 
45 275 280 285 " 

Phe Ser Ala Tyr Thr Ser Asn Cys Lys Lvs Gly Phe Thr Gly Ser Asn 
290 295 * 300 



30 
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Lys Pro Ser Gin Ser Thr Leu Lys Leu Ser Thr Leu His Cy- Gin Glv 

305 310 315 * 320 

Thr Ala Leu Leu Asp Lys Arg Arg Tyr Thr Glu Cys 
325 330 

5 (2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Tyr Lys Phe Leu Arg lie Val Val Trp Phe Val Ser Leu Leu Ala Leu 
15 10 15 

15 Leu Gly Asn Val Phe Val Leu Leu lie Leu Leu Thr Ser His Tyr Lys 

20 25 30 

Leu Asn Val Pro Arg Phe lie Met Asn lie Ala Phe Ala Asp Phe Cys 
35 40 45 

Met Met Tyr Leu Leu Leu lie Ala Ser Val Asp Leu Tyr Thr His Ser 
20 50 55 60 

Glu Tyr Tyr Asn His Ala He Asp Trp Gin Thr Gly Pro Gly Cys Asn 
65 . 70 75 80 

Thr Ala Gly Phe Phe Thr Val Phe Ala Ser Glu Leu Ser Val Tyr Thr 
85 90 95 

2 5 Leu Thr Val He Thr Leu Glu Arg Trp Tyr Ala He Thr Phe Ala Met 

100 105 110 

Arg Leu Asp Arg Lys He Arg Leu Arg His Ala Cys Ala He Met Val 
115 120 125 

Gly Gly Trp Val Cys Cys Phe Leu Leu Ala Leu Leu Pro Leu Val Gly 
30 130 135 140 

He Ser Ser Tyr Ala Lys Val Ser He Cys Leu Pro Met Thr Glu Thr 
145 150 155 160 

Pro Leu Ala Leu Ala Tyr He Val Phe Val Leu Thr Leu Asn He Val 
155 170 175 

3 5 Ala Phe Val He Val Cys Cys Cys Tyr Val Lys He Tyr He Thr Val 

180 185 190 

Arg Asn Pro Gin Tyr Asn Pro Gly Asp Lys Asp Thr Lys He Ala Lys 
195 200 205 

Arg Met Ala Val Leu He Phe Thr Asp Phe He Cys Met Ala Pro lie 
40 210 215 220 

Ser Phe Tyr Ala Leu Ser Ala lie Leu Asn Lvs Pro Leu lie Thr Val 
225 230 235 240 

ser Asn Ser Lys lie Leu Leu Val Leu Phe Tyr Pro Leu Asn Ser Cys 
245 250 255 



45 



Ala Asn Pro Phe Leu Tyr Ala lie Phe Thr Lvs Ala Phe Gin Arg Asp 
260 265 * 270 
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Val Phe He Leu Leu Ser Lys Phe Gly He Cys Lys Arg Gin Ala Gin 
275 280 285 

Ala Tyr Arg Gly Gin Arg Val Pro Pro Lys Asn Ser Thr Asp He Gin 
290 295 300 • . 

5 Val Gin Lys Val Thr His Asp Met Arg Gin Gly Ala Leu Asn Met Glu 

305 310 315 320 

Asp Val Val Glu Leu He Glu Asn Ser His Leu Thr Pro Lys Lys Gin 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 55: 
10 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
15 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Tyr Asn lie Leu Arg Val Leu lie Trp Phe lie Ser He Leu Ala He 
15 10 15 

Thr Gly Asn He He Val Leu Val He Leu Thr Thr Ser Gin Tyr Lys 
20 20 25 30 

Leu Thr Val Pro Arg Phe Leu Met Asn lie Ala Phe Ala Asp Leu Cys 
35 40 45 

He Gly lie Tyr Leu Leu Leu lie Ala Ser Val Asp He His Thr Lys 
50 55 60 

25 Ser Gin Tyr His Asn Tyr Ala lie Asp Trp Gin Arg Gly Ala Gly Cys 

6 5 70 75 80 

Asp Ala Ala Gly Phe Phe Thr Val Phe Ala Ser Glu Leu Ser Val Tyr 
85 90 95 

Thr Leu Thr Ala lie Thr Leu Glu Arg Trp His Thr He Thr His He 
30 100 105 110 

Met Gin He Asp Cys Lys Val Gin Leu Arg His Ala Ala Ser Val Met 
115 , 120 125 

Val Met Gly Trp He Phe Ala Phe Ala Ala Ala Leu Phe Pro lie Phe 
130 135 140 

3 5 Gly lie Ser Ser Tyr Met Lys Val Ser lie Cys Leu Pro Leu lie Asp 

1« 150 155 160 

Ser Pro Leu Ser Gin Leu Tyr Val Met Ser Leu Leu Val Leu Asn Val 
165 170 175 



40 
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Leu Ala Phe Val Val lie Cys Gly Cvs Tvr Thr His He Tyr Leu Thr 

1Q0 185 19^ 

Val Arg Asn Pro £sn lie Val Ser Ser Ser Ser Asp Thr Arg lie Ala 

195 200 205 

Lys Arg Met Leu lie Phe Thr Asp Phe Leu Leu Pro He Ser Phe Phe 

210 • 215 220 

Ala lie Ser Ala Ser Leu Lys Val Pro Leu lie Thr Val Ser Lvs Ala 

225 230 235 * 240 

Lys He Leu Leu Val Leu Phe His Pro lie Asn Ser Cys Ala Asn Pro 
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245 250 255 

Phe Leu Tyr Ala lie Phe Thr Lys Asn Phe Arg Arg Asp Phe Phe He 
260 265 270 

Leu Leu Ser Lys Cys Gly Cys Tyr Glu Met Gin Ala Gin lie Tyr Arg 
5 275 280 285 

Thr Glu Thr Ser Ser Thr Val His Asn Thr His Pro Arg Asn Gly His 
290 295 300 

Cys Ser Ser Ala Pro Arg Val Thr Ser Gly Ser Ser Arg Tyr He Leu 
305 310 315 320 

10 Val Pro Leu Ser Leu Gin Asn 

325 

(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 09 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
20 Ser Met Leu Ala Ala Tyr Met Phe Leu Leu He Val Leu Gly Phe Pro 

15 10 15 

He Asn Phe Leu Thr Leu Tyr Val Thr Val Gin His Lys Lys Leu Arg 
20 25 30 

Thr Pro He Asn Tyr lie Leu Leu Asn Leu Ala Val Ala Asp Leu Phe 
25 35 40 45 

Met Val Leu Gly Gly Phe Thr Ser Thr Leu Tyr Thr Ser Leu His Gly 
50 55 60 

Tyr Phe Val Phe Gly Pro Thr Gly Cys Asn Leu Glu Gly Ph*> Phe Ala 
65 70 75 80 

30 Thr Leu Gly Gly Clu He Ala Leu Trp Ser Leu Trp Leu Ala He Glu 

85 90 95 

Arg Tyr Val Val Val Cys Lys Pro Met Ser Asn Phe Arg Phe Gly Glu 
100 105 110 

Asn His Ala He Met Gly Val Ala Phe Thr Trp Val Met Ala Leu Ala 
35 115 120 125 

Cys Ala Ala Pro Pro lie Ala Gly Trp Ser Arg Tyr He Pro Glu Gly 
130 135 " 140 

Leu Gin Cys Ser Cys Gly lie Asp Tyr Tyr Thr Leu Lys Pro Glu Val 
145 150 155 160 

4 0 Asn Asn Glu Ser Phe Val He Tvr Met Phe Val Val His Phe Thr lie 

165 " 170 175 

Pro Leu lie lie Phe Phe Cys Tyr Gly Gin Leu Val Phe Thr Val Lvs 
180 185 , 190 

Glu Ala Ala Ala Gin Gin Gin Glu Ser Ala Thr Thr Gin Lys Ala Glu 
45 195 200 205 



Lvs Glu Val Thr Arg Met Val He lie Met Val lie Ala Phr; Leu He 
210 215 220 
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10 

(2) 

15 



20 



25 



30 



35 



40 



Cys Trp Val Pro Tyr Ala Ser Val Ala Phe Tyr lie Phe Thr His Gin 
225 230 235 240 

Gly Ser Asn Phe Gly Pro lie Phe Met Arg lie Pro Ala Phe Phe Ala 
245 250 255 

Lys Ser Ala Ala He Tyr Asn Pro Val He Tyr He He Phe Asn Lys 
260 265 270 

Gin Phe Arg Asn Cys Met Leu Gin Leu He Cys Cys Gly Lys Asn Pro 
275 280 285 

Leu Gly Asp Asp Glu Ala Ser Ala Thr Val Ser Lys Arg Glu Thr Ser 
290 295 300 

Gin Val Ala Pro Ala 



INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57 : 

Met He Phe Val Val He Ala Ser Val Phe Thr Asn Gly Leu Val Leu 
15 10 15 

Ala Ala Thr Met Lys Phe Lys Lys Leu Pro His Pro He Asn Trp He 
20 25 30 

Leu Val Asn Leu Ala Val Ala Asp He Ala Gly Thr Val He Ala Ser 
35 40 45 

Thr He Ser Val Val Asn Gin Val Tyr Gly Tyr Phe Val Leu Gly His 
50 55 60 

Pro Met Cys Val Leu Glu Gly Tyr Thr Val Ser Leu Cys Gly He Thr 
65 70 75 80 

Gly Leu Trp Ser Leu Ala He lie Ser Trp Glu Arg Trp Met Val Val 
85 90 95 

Cys Lys Pro Phe Gly Asn Val Arg Phe Asp Ala Lys He Ala lie Val 
100 105 HO 

Gly He Ala Phe Ser Trp He Trp Ala Ala Val Trp Thr Ala Pro Pro 
115 120 125 

He Phe Gly Trp Ser Arg Tyr Trp Pro His Gly Leu Lys Thr Ser Cys 
130 135 140 

Gly Pro Asp Val Phe Ser Gly Ser Ser Tyr Pro Gly Val Gin Ser Leu 
14 5 150 155 160 

Leu Cys He Thr Pro Leu Ser He He Val Leu Cys Tyr Leu Gin Val 
165 170 175 

Trp Thr Ala He Arg Ala Val Ala Lys Gin Gin Lys Glu Ser Glu Ser 
180 185 190 

Thr Gin Lys Ala Glu Lys Glu Val Thr Arg Met Trp Val Met Val Leu 
195 200 205 

Ala Phe Cys Phe Cys Trp Gly Pro Tyr Ala Phe Phe Ala Cys Phe Ala 
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210 215 220 

Ala Ala Asn Pro Gly Tyr Pro Phe His Pro Leu Met Ala Ala Leu Pro 
225 230 235 240 

Ala Phe Phe Ala Lys Ser Ala Thr He Tvr Asn Pro Val He Tyr Val 
5 ?AS 250 255 

Phe Met Asn Arg Gin Phe Arg Asn Cys He Leu Gin Leu Phe Gly Lys 
260 265 270 

Lys Val Asp Asp Gly Ser Glu Leu Ser Ser Ala Ser Lys Thr Glu Val 
275 280 285 

10 Ser Ser Val Ser Ser Val Ser Pro Ala 

290 295 

(2) INFORMATION FOR SEQ ID NO: 58: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 
15 (B) TYPE: amino acid 

(CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

2 0 Arg Cys Phe Val Val Thr Ala Ser Val Phe Thr Asn Gly Leu Val Leu 

15 10 15 

Ala Ala Thr Met Lys Phe Lys Lys Leu Arg His Pro Leu Asn Trp He 
20 25 30 

Leu Val Asn He Ala Val Ala Asp He Ala Gly Thr Val He Ala Ser 
25 35 40 45 

Thr He Ser He Val Asn Gin Val Ser Gly Tyr Phe Val Leu Gly His 
50 55 60 

Pro Met Cys Val Leu Glu Gly Tyr Thr Val Ser Leu Cys Gly lie Thr 
65 70 75 80 

3 0 Gly Leu Trp Ser Leu Ala He He Ser Trp Glu Arg Trp Leu Trp Cys 

85 90 95 

Lys Pro Phe Gly Asn Val Arg Phe Asp Ala Lys He Ala He Val Gly 
100 105 110 

He Ala Phe Ser Trp He Trp Ser Ala Val Trp Thr Ala Pro Pro He 
35 115 120 125 

Phe Gly Trp Ser Arg Tyr Trp Pro His Gly Leu Lys Thr Ser Cys Gly 
130 135 140 

Pro Asp Val Phe Ser Gly Ser Ser Tyr Pro Gly Val Gin Ser Leu Val 
145 150 155 160 

4 0 lie Met Val Thr Cys Cys lie lie Pro lie Ala He lie Leu Cys Tyr 

165 170 175 

Leu Gin Val Trp Leu Ala lie Arg Ala Val Ala Lys Gin Gin Lys Glu 
180 18iL 190 

Ser Glu Ser Thr Gin Lys Ala Glu Lys Glu Val Thr Arg Met Leu Phe 
45 195 200 205 

Ala Tvr Cvs Val Cvs Trp Gly Pro Tvr Thr Phe Phe Ala Cys Phe Ala 
210 215 220 
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Ala Ala Asn Pro Gly Tyr Ala Phe His Pro Leu Met Ala Ala Leu Pro 
225 230 235 240 

Ala Tyr Phe Ala Lys Ser Ala Thr II Tyr Asn Pro Val He Tyr Val 
245 250 255 

5 Phe Met Asn Arg Gin Phe Arg Asn Cys He Leu Gin Leu Phe Gly Lys 

260 265 270 

Lys Val Asp Asp Gly Ser Glu Leu Ser Ser Ala Ser Lys Thr Glu Val 
275 280 285 

Ser Ser Val Ser Ser Val Ser Pro Ala 
10 290 295 

(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

<D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

Gin Ala Ala Phe Met Gly Thr Val Phe Leu He Gly Phe Pro Leu Leu 
20 1 5 10 15 

Val Ala Thr Leu Ala Tyr Lys Lys Leu Arg Gin Pro Asn Tyr He Leu 
20 25 30 

Val Asn Val Ser Phe Gly Gly Phe Leu Leu Cys He Phe Ser Val Phe 
35 40 45 

25 Pro Val Phe Val Ala Ser Cys Asn Gly Tyr Phe Val Phe Gly Arg His 

50 55 60 

Val Cys Ala Leu Glu Gly Phe Leu Gly Thr Val Ala Gly Leu Val Thr 
65 70 75 80 

Gly Trp Ser Leu Ala Phe Leu Ala Phe Glu Arg Tyr He Val He Cys 
30 85 90 95 

Lys Pro Phe Gly Asn Phe Arg Phe Ser Ser Lys His Ala Leu Thr Val 
100 105 110 

Val He Ala Thr Trp Thr He Gly He Gly Val Ser He Pro Pro Phe 
115 120 125 

35 Phe Gly Trp Ser Arg Phe He Pro Glu Gly Leu Gin Cys Ser Cys Gly 

130 135 140 

Pro Asp Lys Tyr Thr Val Gly Thr Lys Tyr Arg Ser Glu Ser Tyr Thr 
145 150 155 160 

Trp Phe Leu Phe He Phe Cys Phe He Val Pro Leu Ser Leu He Cys 
40 165 170 175 

Phe Ser Tyr Thr Gin Leu Leu Arg Ala Leu Lys Ala Val Ala Ala Gin 
180 185 190 

Gin Gin Glu Ser Ala Thr Thr Gin Lys Ala Glu Arg Glu Val Ser Arg 
195 200 205 

45 Met Val Val Val Met Val Gly Ser Phe Cys Val Cys Tyr Val Pro Tyr 

210 215 220 

Ala Ala Phe Ala Met Tyr Met Val Asn Asn Arg Asn His Gly Leu Asp 
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225 230 235 240 

Leu Arg Leu Val Arg lie Pro Ser Phe Phe Ser Lys Ser Ala Cys lie 
245 250 255 

Tyr Asn Pro lie lie Tyr Cys Phe Met Asn Lys Gin Phe Gin Ala Cys 
5 260 265 270 

lie Met Met Val Cys Gly Lys Ala Met Met Glu Ser Asp Thr Cys Ser 
275 280 285 

Ser Gin Lys Thr Glu Val Ser Thr Val Ser Ser Thr Gin Val Gly Pro 
290 295 300 

10 Asn 
305 

(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 293 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

2 0 Leu He Tyr Gly Leu Phe Leu Ser Met Tyr Leu Val Thr Val He Gly 

15 10 15 

Asn He Ser He He Val Ala lie He Ser Asp Pro Cys Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Asn Leu Ser Phe Val Asp He Cys Phe 
25 35 40 45 

He Ser Thr Thr Val Pro Val Asn Thr Gin Thr Gin Asn Asn Val He 
50 55 60 

Thr Tyr Ala Gly Cys He Thr Gin He Tyr Phe Phe Leu Leu Phe Val 
65 70 75 80 

3 0 Glu Leu Asp Asn Phe Leu Leu Thr He Met Ala Tyr Asp Arg Tyr Val 

85 90 95 

Ala He Cys His Pro Met His Tyr Thr Val He Met Asn Tyr Lys Leu 
100 105 110 

Cys Gly Phe Leu Val Leu Val Ser Trp He Val Ser Val Leu His Ala 
35 115 120 125 

Leu Phe Gin Ser Leu Ala Leu Pro Phe Cys Thr His Leu Glu He Pro 
130 135 140 

His Tyr Phe Cys Clu Pro Asn Gin Val He Gin Leu Thr Cys Ser Asp 
145 150 155 160 

4 0 Ala Phe Leu Asn Asp Leu Val He Tyr Phe Thr Leu Val Leu Leu Ala 

165 170 175 

Thr Val Pro He Ala Gly He Phe Tyr Ser Tyr Phe Ala He Ser Ser 
180 m 185 13 

Val His Gly Lys Tyr Lys Ala Phe Ser Thr Cys Ala Ser His Leu Ser 
45 195 200 205 

Val Val Ser Leu Phe Tyr Cys Thr Gly Leu Glv Val Tyr Leu Ser Ser 
210 215 * 220 
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Ala Ala Asn Asn Ser Leu Ser Ala Thr Ala Ser Val Met Tyr Thr Val 
225 230 235 240 

Val Thr Pro Met Val Asn Pro Phe lie Tyr Ser Leu Arg Asn Lys Asp 
245 250 255 

5 Val Lys Ser Val' Leu Lys Lys Thr Leu Cys Glu Glu Val lie Arg Ser 

260 265 270 

Pro Pro Ser Leu Leu His Phe Phe Leu Val Leu Cys His Leu Pro Cys 
275 280 285 

Phe lie Phe Cys Tyr 
10 290 

(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 84 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Leu Leu Phe Leu Leu Phe Leu lie Met Tyr Leu Ala Thr Val Leu Gly 
20 1 5 10 15 

Asn Leu Leu lie lie Leu Ala lie Gly Gly Asp Ser Arg Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Asn Leu Ser Phe Val Asp Val Cys Phe 
35 40 45 

25 Ser Ser Thr Thr Val Pro Lys Val Leu Ala Asn His He Leu Gly Ser 

50 55 60 

Gin Ala He Ser Phe Ser Gly Cys Leu Thr Gin Leu Tyr Phe Leu Ala 
65 70 75 80 

Val Phe Gly Asn Met Asp Asn Phe Leu Leu Ala Val Met Ser Tyr Asp 
30 85 90 95 

Arg Tyr Val Ala He Cys His Pro Leu His Tyr Thr Thr He Arg Gin 
100 105 110 

Leu Cys Val Leu Leu Val Val Gly Ser Trp Val Val Ala Asn Met Asn- 
115 120 125 

3 5 Cys Leu Leu His He Leu lie Met Ala Arg Lys Ser Phe Cys Ala Asp 

130 135 140 

Leu Pro His Phe Phe Cys Asp Gly Thr Pro Leu Leu Lys Leu Ser Cys 
145 150 155 160 

Ser Asp Thr His Leu Asn Glu Leu Met He Leu Thr Glu Gly Ala Val 
40 165 170 175 

Val Met Val Thr Pro Phe Val Cys He Leu He Ser Tyr He His He 
180 185 * 190 

Thr Cys Ala Val Leu Arg Val Ser Ser Pro Arg Gly Gly Trp Lys Ser 
195 200 205 

4 5 Phe Ser Thr Cys Cly Ser His He Ala Val Val Cys Leu Phe Tyr Glv 

210 215 220 

Thr Val He Ala Val Tyr Phe Asn Pro Ser Ser Ser His Leu Ala Glv 
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225 230 235 240 

Arg Asp Met Ala Ala Ala Val Met Tyr Ala Val Val Thr Pro Met lie 
245 250 255 

Asn Pro Phe lie Tyr Ser Leu Arg Asn Ser Asp Met Lys Ala Ala Leu 
5 260 265 270 

Arg Lys Val Leu Ala Met Arg Phe Pro Ser Lys Gin 
275 280 

(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Leu Leu Phe Leu Leu Phe Leu Val Met Tvr Leu Leu Thr Val Val Gly 
1 5 10 15 

Asn Leu Ala lie lie Ser Leu Val Gly Ala His Arg Cys Leu Gin Pro 
20 25 30 

2 0 His Thr Pro Met Tyr Phe Phe Leu Cys Asn Leu Ser Phe Leu Glu lie 

35 40 45 

Trp Phe Thr Thr Ala Cys Val Pro Lys Thr Leu Ala Thr Phe Ala Pro 
50 55 60 

Arg Gly Gly Val He Ser Leu Ala Gly Cys Ala Thr Lys Tyr Phe Val 
25 65 70 75 80 

Phe Ser Leu Gly Cys Thr Glu Tyr Phe Leu Leu Ala Val Met Ala Tyr 
85 90 95 

Asp Arg Tyr Leu Ala He Cys Leu Pro Leu Arg Tyr Gly Gly He Met 
100 105 110 

3 0 Arg Pro Gly He Ala Met Arg Leu Ala Leu Gly Ser Trp Leu Cys Gly 

115 120 125 

Phe Ser Ala He Thr Val Pro Ala Thr Leu He Ala Arg Leu Ser Phe 
130 135 140 

Cys Gly Ser Arg Val He Asn His Phe Phe Cys Asp lie Ser Pro Trp 
35 145 150 155 160 

He Val Leu Ser Cys Thr Asp Thr Gin Val Val Glu Leu Val Ser Phe 
165 170 175 

Gly He Ala Phe Cys Val He Leu Gly Ser Cys Gly He Thr Leu Val 
180 185 190 

4 0 Ser Tyr Ala Lys lie Pro Ser Ala Arg Gly Arg His Arg Ala Phe Ser 

195 200 205 

Thr Cys Ser Ser His Leu Thr Val Val Leu Hp Trp Tyr my c^ r *fhr 
' ZTu ' \ 215 ~ 220 

He Phe Leu His Val Arg Thr Ser Val Glu Ser Ser Leu Asp Leu Thr 
45 225 230 235 240 

Lys Ala .He Thr Val Leu Asn Thr lie Val Thr Pro Val Leu Asn Pro 
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245 250 255 

Phe lie Tyr Thr Leu Arg Asn Lvs Asp Val Lys Glu Ala Leu Arg Arg 
260 " 265 270 

Thr Val Lys Gly Lys 
5 275 

(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS: single 

<D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu lie Phe Ala Leu Phe Leu Ser Met Tyr Leu Val Thr Val Leu Glv 

15 1 5 10 15 

Asn Leu Leu lie lie Met Ala He He Thr Gin Ser His Leu His Thr 
20 25 30 

Pro Met Tyr Phe Phe Leu Ser Phe Val Asp He Cys Phe Thr Ser Thr 
35 40 45 

20 Thr He Pro Leu Val Asn He Tyr Thr Gin Ser Lys Ser He Thr Tyr 

50 55 60 

Glu Asp Cys He Ser Leu Val Phe Ala Glu Leu Gly Asn Phe Leu Leu 
65 70 75 80 

Ala Val Met Ala Tyr Asp Arg Tyr Val Ala Xaa Cys His Pro Leu Cys 
25 85 90 95 

Tyr Thr Val He Val Asn His Arg Leu Cys He Leu Leu Le- Leu Leu 
100 105 110 

Ser Trp Val He Ser He Phe Arg Ala Phe He Gin Ser Leu He Val 
115 120 125 

30 Leu Gin Leu Thr Phe Cys Gly Asp Val Lys He Pro His Phe Phe Cys 

130 135 140 

Glu Leu Asn Gin Leu Ser Gin Leu Thr Cys Ser Asp Asn Phe Pro Ser 
145 150 155 160 

His Leu He Met Asn Leu Val Pro Val Met Leu Ala Ala He Ser Phe 
35 165 170 175 

Ser Gly He Leu Tyr Ser Tyr Phe Ser He Ser Thr Val Gin Gly Lys 
180 185 190 

Tyr Lys Ala Phe Ser Thr Cys Ala Ser His Leu Ser He Val Ser Leu 
195 200 205 

4 0 Phe Tyr Ser Thr Gly Leu Glv Val Tyr Val Ser Ser Ala Val Val Gin 

210 215 220 

Ser Ser His Ser Ala Ala Ser Ala Ser Val Met Tyr Thr Val Val Pro 
225 230 235 240 

Met Leu Asn Pro Phe He Tyr Ser Leu Arg Asn Lvs Asp Va* Lys Arg 
45 245 250 " 255 

Ala Leu Glu Arg Leu Leu Glu Gly Asn Cvs Lvs Val His His Trp Thr 
260 265 * * 270 
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Gly 

(2) INFORMATION FOR SEQ ID NO : 64 : 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGT^: 2 69 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

Leu Phe Tyr Ala Leu Phe Leu Val Met Tyr Leu Thr Thr lie Leu Gly 
15 10 15 

Asn Leu Leu lie lie Val Leu Val Gin Leu Asp Ser Gin Leu His Thr 
20 25 30 

15 Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe 

35 40 45 

Ser Ser Leu Lys Leu Leu Gin Asn Met Arg Ser Gin Asp Thr Ser lie 
50 55 60 

Pro Tyr Gly Gly Cys Leu Ala Gin Thr Tyr Phe Phe Met Val Phe Gly 
20 65 70 75 80 

Asp Leu Ser Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val Ala 
85 90 95 

lie Cys Phe Leu Pro His Tyr Thr Ser lie Met Ser Pro Lys Leu Cys 
100 105 110 

25 Thr Cys Leu Val Leu Leu Leu Trp Met Leu Thr Thr Ser His Met Met 

115 120 125 

Thr Leu Leu Ala Ala Arg Leu Ser Phe Cys Glu Asn Asn Trp Leu Asn 
130 135 140 

Phe Phe Cys Asp Leu Phe Val Leu Leu Lys lie Ala Cys Ser Asp Thr 
30 145 150 155 160 

Tyr lie Asn Glu Leu Phe lie Met Ser Thr Leu Leu lie lie lie Pro 
165 170 175 

Phe Phe Leu lie Val Met Ser Tyr Ala Lys Val Pro Ser Thr. Gin Gly 
180 185 190 

35 lie Cys Lys Val Phe Ser Thr Cys Gly. Ser His Leu Ser Val Val Ser 

195 200 205 

Leu Phe Tyr Gly Thr lie lie Gly Leu Tyr Leu Cys Pro Ala Gly Asn 
210 215 220 

Asn Ser Thr Val Lys Glu Met Val Met Ala Met Met Tyr Thr Val Val 
40 225 230 235 240 

Thr Pro Met lie Asn Pro Phe lie Tyr Ser Leu Arg Asn Arg Asp Leu 
245 250 255 

Arg Ala Leu He A rg Val He Cys Ser Met He Thr Leu 

■ 265" " 



4 5 (2) INFORMATION FOR SEQ ID NO: 65: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 86 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
5 Leu Leu Phe Phe Leu Ser Leu Leu Xaa Tyr Val Leu Val Leu Thr Glu 

15 10 15 

Asn Met Leu lie He He Ala He Arg Asn His Pro Thr Leu His Lys 

20 25 30 

Pro Met Tyr Phe Phe Leu Phe Leu Glu He Trp Tyr Val Thr Val Thr 
10 35 40 45 

He Pro Lys Leu Met Gly Phe He Gly Ser Lys Glu Asn His Gly Gin 
50 55 60 

Leu He Ser Phe Phe Ala Cys Met Thr Gin Leu Tyr Phe Phe Leu Gly 
65 70 75 80 

15 Leu Gly Cys Thr Glu Cys Val Leu Leu Ala Val Met Ala Tyr Asp Arg 

85 90 95 

Tyr Val Ala He Cys His Pro Leu His Tyr Pro Val He Val Ser Ser 
100 105 110 

Arg He Glx Val Leu Gly Ser Trp Ala Gly Gly Phe Gly He Ser Met 
20 115 120 125 

Val Lys Val Phe Leu He Ser Arg Leu Ser Tyr Cys Gly Pro Asn Thr 
130 135 140 

He Asn His Phe Phe Cys Asp Val Ser Pro Leu Leu Asn Leu Ser Cys 
145 150 155 160 

25 Thr Asp Met Ser Thr Ala Glu Leu Thr Asp Phe Val He Ala He Phe 

165 170 175 

He Leu Leu Gly Pro Leu Ser Val Thr Gly Ala Ser Tyr Met Arg He 
180 185 190 

Pro Ser Ala Ala Gly ^9 His L V S Pn e Ser Thr Cys Ala Ser His 
30 195 200 205 

Leu Thr Val Val He He Phe Tyr Ala Ala Ser He Phe He Tyr Ala 
210 215 220 

Arg Pro Lys Ala Leu Ser Ala Phe Thr Asp Asn Lys Leu Va.-. Ser Val 
225 230 235 240 

3 5 Leu Tyr Ala Val He Val Pro Leu Phe Asn Pro He He Tyr Cys Leu 

245 250 255 

Arg Asn Gin Asp Val Lys Arg Ala Leu Arg Arg Thr Leu His Leu Ala 
260 265 270 

Gin Asp Gin Glu Ala Asn Thr Asn Lys Gly Ser Lys He Gly 
40 275 280 285 



(2) INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 66: 

Leu Phe Phe Ala Leu Phe Leu lie Met Tyr Leu Thr Thr Phe Leu Gly 

15 10 15 

Asn Leu Leu lie Val Val Leu Val Gin Leu Asp Ser His Leu His Thr 

5 20 25 30 

Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe 
35 40 45 

Ser Ser Val Thr Met Leu Lys Leu Leu Gin Asn lie Gin Ser Gin Val 
50 55 60 

10 Pro Ser lie Ser Tyr Ala Gly Cys Leu Trp He Phe Phe Phe Leu Leu 

65 70 75 80 

Phe Gly Tyr Leu Gly Asn Phe Leu Leu Val Ala Met Ala Tyr Asp Arg 
85 90 95 

Tyr Val Ala He Cys Phe Pro Leu His Tyr Thr Asn He Met Ser His 

15 100 105 110 

Lys Leu Cys Thr Cys Leu Leu Leu Val Phe Trp He Met Arg Ser Ser 
115 120 125 

His Ala Met Met He Thr Leu He Ala Ala Arg Leu Ser Phe Cys Glu 
130 135 140 

2 0 Asn Asn Val Leu Leu Asn Phe Phe Cys Asp Leu Phe Val Leu Leu Lys 

145 150 155 160 

Leu Ala Cys Ser Asp Thr Tyr Val Asn Glu Leu Met He His He Met 
.55 170 175 

Glu Val He He He Val He Pro Phe Val Leu He Val He Ser Tyr 
25 180 185 190 

Ala Lys Val Pro Ser Thr Gin Ser He His Lys Val Phe Ser Thr Cys 
195 200 205 

Gly Ser His Leu Ser Val Val Ser Leu Phe Tyr Gly Thr He He Gly 
210 215 220 

3 0 Leu Tyr Leu Cys Pro Ser Gly Asp Asn Phe Ser Leu Lys Gly Ser Leu 

225 230 235 240 

Thr Val Val Thr Pro He Met Pro Phe He Tyr Ser Leu Arg Asn Arg 
245 250 255 

Asp Met Lys Gin Ala Leu He Arg Val Thr Cys Ser Lys Lys lie Ser 

35 260 265 270 

Leu Pro Trp 
275 

(2) INFORMATION FOR SEQ ID NO: 67: 
(l) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 2 84 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(m TOPnT,nr,v- iin fla r 

(li) MOLECULE TYPE: peptide 

4 5 (xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Leu Phe Tyr Ala Leu Phe Leu Ala Met Tyr Leu Thr Thr Leu Leu Gly 

1 5 10 15 
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Asn Leu lie lie lie lie Leu lie Leu Leu Asp Ser His Leu His Thr 
20 25 30 

Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ala Asp Leu Cys Phe 
35 40 45 

5 Ser Ser Leu Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser lie 

50 55 60 

Pro Tyr Ala Gly Cys Leu Ala Gin He Tyr Phe Phe Leu Phe Phe Gly 
65 70 75 80 

Asp Leu Gly Asn Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val 
10 85 90 95 

Ala He Cys Phe Pro Leu His Tyr Met Ser He Met Ser Pro Lys He 
100 105 no 

Glx Val Ser Leu Val Val Leu Ser Trp Val Leu Thr Thr Phe His Ala 
115 120 125 

15 Met Leu His Thr Leu He Met Ala Arg Leu Ser Phe Cys Gl*.. Asp Ser 

130 135 140 

Val He Pro His Tyr Phe Cys Asp Met Ser Thr Leu Leu Lys Val Ala 
145 150 155 160 

Cys Ser Asp Thr His Asp Asn Glu Leu Ala He Phe He Leu Gly Gly 
20 165 170 175 

Pro He Val Val Leu Pro Phe Leu Leu He He Val Ser Tyr Ala Arg 
180 185 190 

He Val Ser Ser He Phe Lys Val Pro Ser Ser Gin Ser He His Lys 
195 200 205 

25 Ala Phe Ser Thr Cys Gly Ser His Leu Ser Val Val Ser Leu Phe Tyr 

210 215 220 

Gly Thr Val He Gly Leu Tyr Leu Cys Pro Ser Ala Asn Asn Ser Glu 
225 230 235 240 

Val Lys Glu Thr Val Met Ser He Tyr Thr Met Val Pro Met Leu Asn 

30 245 250 255 

Pro Phe He Tyr Ser Leu Arg Asn Arg Asp He Lys Asp Ala Leu Glu 
260 265 270 

Lys He Met Cys Lys Lys Gin He Pro Ser Phe Leu 
275 280 

3 5 (2) INFORMATION FOR SEQ ID NO: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
40 (D) TOPOLOGY: linear 

(iii MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 68: 

Leu Phe Tyr Ala Leu Phe Leu Ala Met Tvr Leu Thr He He Leu Gly 
15 10 15 

45 Asn Leu Leu II He Val Leu Val Arg Leu Asd Ser His Leu His Met 

20 25 30 
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Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cys Phe Ser Ser 
35 40 45 

Val Thr Trp Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser He 
50 55 60 

5 Ser Tyr Thr Gly Cys Leu Thr Gin Leu Tyr Phe Phe Met Val Phe Gly 

65 70 ' 75 80* 

Asp Trp Ser Phe Leu Leu Val Val Met Ala Tyr Asp Arg Tvr Val Ala 
85 90 * 95 

He Cys Phe Pro Leu Arg Tyr Thr Thr He Met Ser Thr Lys Phe Cys 
10 100 105 110 

Ala Ser Leu Val Leu Leu Leu Trp Met Leu Thr Met Arg His Ala Leu 
115 120 125 

Leu His Thr Leu Leu He Ala Arg Leu Ser Phe Cvs Glu Asp Ser Val 
130 135 140 

15 lie Leu His Phe Phe Cys Asp He Ser Ala Leu Leu Lys Leu Ser Cys 

145 150 155 160 

Ser Asp He Tyr Val Asn Glu Leu Met He Tyr He Leu Gly Gly Leu 
165 170 175 

He He He He Pro Phe Leu Leu He Val Met Ser Tyr Val Arg He 
20 180 185 190 

Phe Phe Ser He Leu Lys Phe Pro Ser He Gin Asp He Ty- Lvs Val 
195 200 205 

Phe Ser Thr Cys Gly Ser His Leu Ser Val Val Thr Leu Phe Tyr Gly 
210 215 220 

2 5 Thr He Phe Gly He Tyr Leu Cys Pro Ser Gly Asn Asn Ser Thr Val 

225 230 235 240 

Lys Glu He Leu Thr Val Val Thr Pro Met He Asn Pro Phe He Tyr 
245 250 255 

Ser Leu Arg Asn Arg Asp Trp Arg Ala Leu lie Arg Val He Cys Thr 
30 260 265 270 

Lys Lys lie Ser Leu 
275 

(2) INFORMATION FOR SEQ ID NO: 69: 
(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH : 2 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
MOLECULE TYPE: peptide 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Val Phe Tyr Ala Leu Phe Leu Ser Met Tyr Leu Thr He Vai Leu Gly 
1 5 10 15 

Asn Leu He He lie lie Leu He His Leu Asp Ser His Leu His Thr 
20 25 30 

4 5 Pro Met Tyr Leu Phe Leu Ser Asn Leu Ser Phe Ser Asp Leu Cvs Phe 

35 40 45 
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Ser Ser Leu Lys Leu Leu Gin Asn Met Gin Ser Gin Val Pro Ser lie 
50 55 60 

Pro Phe Ala Gly Cys Leu Thr Gin Leu Tyr Phe Tyr Leu Tyr Phe Ala 
65 70 75 80 

5 Asp Leu Glu Ser Phe Leu Leu Val Ala Met Ala Tyr Asp Arg Tyr Val 

65 90 95 

Ala lie Cys Phe Pro Leu His Tyr Met Ser lie Met Ser Pro Lys Leu 
100 105 110 

Cys Val Ser Leu Trp Leu Ser Trp Val Leu Thr Thr Phe His Ala Met 
10 115 120 125 

Leu His Thr Leu He Met Ala Arg Leu Ser Phe Cys Ala Asp Leu Pro 
130 135 140 

His Phe Phe Cys Asp He Ser Pro Leu Leu Lys Leu Ser Cys Ser Asp 
145 150 155 160 

15 Thr His Val Asn Glu Leu Val He Phe Leu Gly Leu Val He Val He 

165 170 175 

Pro Phe Val Leu He He Val Ser Tyr Ala Arg Val Val Ala Ser He 
180 185 190 

Leu Lys Val Pro Ser Val Arg Gly He His Lys He Phe Ser Thr Cys 
20 195 200 205 

Gly Ser His Leu Ser Val Val Ser Leu Phe Tyr Gly Thr He He Gly 
210 215 220 

Leu Tyr Leu Cys Pro Ser Ala Asn Asn Ser Thr Val Lys Glu Thr Leu 
225 230 235 240 

2 5 Thr Val Val Thr Pro Leu Pro Phe He Tyr Ser Leu Arg Asn Arg Asp 

245 250 255 

Met Lys Glu Ala Leu He Arg Val Leu Cys Lys Lys Lys He Thr Phe 
260 265 270 



30 



Cys Leu 



(2) INFORMATION FOR SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 amino acids 

(B) TYPE: amino acid 

3 5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Leu Ala He Ala Val Leu Ser Leu Thr Leu Leu Gly Thr Phe Thr Val 

40 1 5 10 15 

Leu Glu Asn Leu Leu Val Leu Cys Val He Leu His Ser Arg Ser Leu 
20 25 30 

Arg Cys Arg Pro Ser Tyr His Phe He Gly Ser Leu Ala Val Ala Asp 
35 40 45 

45 Leu Leu Gly Ser Val He Phe Val Tyr Ser Phe Val Asp Phe His Val 

50 55 60 

Phe His Arg Lys Asp Ser Pro Asn Val Phe Leu Phe Lys Leu Gly Glv 

65 70 . 75 80 
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Val Thr Ala Ser Phe Thr Ala Ser Val Gly Ser Leu Phe Leu Thr Ala 
85 90 95 

lie Asp Arg Tyr lie Ser lie His Pro Pro lie Ala Tyr Lys Arg lie 
100 105 110 

5 Val Arg Arg Pro Lys Ala Val Val Ala Phe Cys Leu Met Thr He Ala 

115 120 125 

He Val He Ala Val Leu Pro Leu Leu Gly Trp Asn Cys Lys Lys Leu 
130 135 140 

Gin Ser Val Cys Cys Asp He Phe Pro Leu He Asp Gly Thr Tyr Leu 
10 145 150 155 160 

Met Phe Trp He Gly Val Thr Ser Val Leu Leu Leu Phe II j Val Tyr 

165 170 175 

Ala Tyr Met Tyr lie Leu Trp Lys Ala His Ser His Ala Val Arg Ala 
180 185 190 

15 Gin Arg Gly Thr Gin Lys Ser He He He His Thr Ser Glu Asp Gly 

195 200 205 

Lys Val Gin Val Thr Arg Pro Asp Gin Ala Arg Met Asp He Arg Leu 
210 215 220 

Ala Lys Thr Leu Val Leu He Leu Val Val Leu He lie Cys Trp Gly 
20 225 230 235 240 

Pro Leu Leu Ala lie Met Val Tyr Asp Val Phe Gly Leu Leu lie Lys 
245 250 255 

Thr Val Phe Ala Phe Cys Ser Leu Leu He Asn Ser Thr Val Asn Pro 

260 265 270 

25 He He Tyr Ala Leu Arg Ser Lys Asp Leu Arg His Ala Phe Arg Ser 

275 280 285 

Trp Pro Ser Cys Glu Gly Thr Ala Gin Pro Leu Asp Asn Ser Met Gly 
290 295 300 

Asp Ser Asp Cys Leu His Lys His Ala Asn Asn Thr Ala Se±: Met His 
30 305 310 315 320 

Arg Ala Ala Glu Ser Cys He Lys Ser Thr Val Lys Leu Ala Leu Val 
325 330 335 

Ser Thr Asp Thr Ser Ala Glu Ala Leu 
340 345 

3 5 (2) INFORMATION FOR SEQ ID NO: 71: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 349 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

4 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIP TION: SEQ ID NO: 71: 

Lvs Ala Leu Leu lie Val Ala Tyr Ser Phe Thr lie Val Phe Ser Leu 
15 10 15 

4 5 Phe Gly Asn Val Leu Val Cys His Tvr He Phe Lys Asn Gin Arg Lys 

20 25 30 
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He Ser Ala Thr Ser Leu Phe He Val Asn Leu Ala Val Ala Asp He 
35 40 45 

He Glu Thr Leu Leu Asn Thr Pro Phe Thr Leu Val Arg Phe Val Asn 
50 '55 60 

Ser Thr Trp Tyr Phe Gly Lys Gly Met Leu His Val Ser Arg Phe Ala 
65 70 75 80 

Gin Tyr Cys Ser Leu His Val Ser Ala Leu He Leu Thr Ala He Ala 
85 90 95 

Val Asp Arg His Gin Val He Met Pro Leu Lys Pro Arg He Ser He 
100 105 no 

Thr Lys Gly Val He Tyr He Ala Val He Trp Val Met Thr Phe Phe 
H5 120 125 

Ser Leu Pro His Ala He Cys Gin Lys Leu Phe Thr Phe Lys Tyr Ser 
130 135 140 

15 Glu Asp He Val Arg Ser Leu Cys Leu Asp Pro Phe Pro Glu Pro Ala 

145 150 155 160 

Asp Leu Phe Trp Lys Tyr Leu Asp He Ala Thr Phe He Leu Leu Tyr 
165 170 175 

Leu Leu Pro Leu Phe He He Ser Val Ala Tyr Ala Arg Val Ala Lys 
180 185 190 

Lys Leu Trp Leu Cys Asn Thr He Gly Asp Val Thr Thr Glu Gin Tyr 
195 200 205 

Leu Ala Leu Arg Arg Lys Lys Lys Thr Thr Val Lys Met Leu Val Leu 
210 215 220 

Val Val Val Leu Phe Ala Leu Cys Trp Phe Pro Leu Asn Cys Tyr Val 
225 230 235 240 

Leu Leu Leu Ser Ser Lys Ala He His Thr Asn Asn Ala Leu Tyr Phe 
245 250 255 

Ala Phe His Trp Phe Ala Met Ser Ser Thr Cys Tyr Asn Pro Phe He 
260 265 270 

Tyr Cys Trp Leu Asn Glu Asn Phe Arg Val Glu Leu Lys Ala Leu Leu 
275 280 285 

Ser Met Gin Pro Pro Pro Lys Pro Glu Asp Arg Leu Pro Ser Pro Val 
290 295 300 

Pro Ser Phe Arg Val Ala Trp Thr Glu Lys Ser His Gly Arg Arg Ala 
305 310 315 320 

Pro Leu Pro Asn His His Leu Pro Ser Ser Gin He Gin Ser Gly Lys 
^25 330 335 

Thr Asp Leu Ser Ser Val Glu Pro Val Val Ala Met Ser 
40 340 



20 



25 



30 



35 



345 



(2) INFORMATION FOR SEQ ID NO:72: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 301 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

lie Phe Thr lie Ala Leu Ala Tyr Gly Ala Val He He Leu Glv Val 
15 10 is' 

Ser Gly Asn Leu Ala Leu He He He He Leu Lys Gin Lys Glu Leu 
5 20 25 30 

He Leu He Val Asn Leu Ser Phe Ser Asp Leu Leu Val Ala Val Trp 
35 40 45 

Leu Pro Phe Thr Phe Val Tyr Thr Leu He Cys His Trp Val Phe Gly 
50 55 60 

10 Glu Cys Cys Lys Leu Asn Pro Phe Val Gin Cys Val Ser He Thr Val 

65 70 75 80 

Ser He Phe Ser Leu Val Leu He Ala Val Glu Arg His Gl.. Leu He 
85 90 95 

He Asn Pro Arg Gly Trp Arg Pro Asn Asn Arg His Ala Tyr He Gly 
15 100 105 110 

He Thr Val He Trp Val lie Ala Val Ala Ser Ser Leu Pro Phe Val 
115 120 125 

He Tyr Gin He Leu Thr Asp Glu Pro Phe Gin Asn Val Ser Leu Ala 
130 135 140 

2 0 Ala Phe Lys Asp Lys Tyr Val Cys Phe Asp Lys Phe Pro Ser Asp Ser 

145 150 155 160 

His Arg Leu Ser Tyr Thr Thr Leu Leu Leu Val Leu Gin Tyr Phe Gly 
165 170 175 

Pro Leu Cys Phe He Phe He Cys Tyr Phe Lys He Tyr He Arg Leu 
25 180 185 190 

Lys Arg Arg Asn Asn Met Met Lys He Arg Asp Ser Lys Tyr Arg Ser 
195 200 205 

Ser Glu Thr Lys Arg He Asn Val Met Leu Leu Ser He Val Val Ala 
210 215 220 

3 0 Phe Ala Val Cys Trp Leu Pro Leu Thr He Phe Asn He Va. Phe Asp 

225 230 235 240 

Trp Asn His Gin He He Ala Thr Cys Asn His Asn Leu Leu Phe Leu 
245 250 255 

Leu Cys His Leu Thr Leu Ser Thr Cys Val Asn Pro lie Phe Tyr Glv 
35 260 265 270 

Phe Leu Asn Lys Asn Phe Gin Arg Asp Leu Gin Phe Phe Phe Asn Phe 
275 280 285 

Cys Asp Phe Arg Ser Arg Asp Gly Arg Thr Thr Arg Leu 
290 295 300 

4 0 (2) INFORMATION FOR SEQ ID NO: 73: 

Hi gpnrTPrjrr rwiiP^rrrnTrTTrr. 

(A) LENGTH: 334 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : smqle 
4 5 ( d J TOPOLOGY : 1 inear 

iiii MOLECULE TYPE: peptide 
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ixi) SEQUENCE DESCRIPTION : SEQ ID NO:73: 

Leu Thr Ser Val Val Phe He Leu He Cys Cys Phe He He Leu Glu 

1 5 10 15 

Asn lie Phe Val Leu Leu Thr lie Trp Lys Thr Lys Lys Phe His Arg 
20. 25 30 

Pro Met Tyr Tyr Phe He Gly Asn He Ala Leu Ser Asp Leu Ile Ala 
35 40 45 

Gly Val Ala Tyr Thr Ala Asn Leu Leu Leu Ser Gly Ala Thr Thr Tvr 
50 55 60 

Lys Leu Thr Pro Ala Gin Trp Phe Leu Arg Glu Gly Ser Met Phe Val 
65 70 75 80 

Ala Leu Ser Leu Ser Val Phe Ser Leu Leu Ala He Ala He Glu Arg 
85 90 95 

. c T V r Ile Thr Met Leu Lys Met Leu His Asn Gly Ser Asn Asn Phe Arg 

lb 100 105 no 

Leu Phe Leu Leu Ile Ser Ala Cys Trp Val He Ser Leu Ile Leu Gly 
H5 120 125 

Gly Leu Pro Ile Met Gly Trp Asn Cys Ile Ser Ala Leu Ser Ser Cys 
130 135 140 

20 Ser Thr Val Leu Pro Leu Tyr His Lys His Tyr He Leu Phe Cys Thr 

145 150 155 160 

Leu Ile Val Phe Thr Leu Leu Leu Leu Ser He Val He Leu Tyr Cys 
165 170 175 

oc ^9 Ile Ty* Ser Leu Val Arg Thr Arg Ser Arg Arg Leu Thr Phe Arq 

^ b 180 185 190 

Lys Asn Ile Ser Lys Ala Ser Arg Ser Ser Glu Asn Val Ala Leu Leu 
195 200 205 

Lys Thr Val He Ile Val Leu Ser Val Phe He Ala Cys Trp Ala Pro 
210 215 220 

3 0 Leu Phe Ile Leu Leu Leu Leu Asp Val Gly Cys Lys Val Lys Thr Cys 

225 230 235 240. 

Asp Ile Leu Phe Arg Ala Glu Tyr Phe Leu Val Ile Ala Val Ile Asn 
245 250 255 

Ser Gly Thr Asn Pro Ile Ile Tyr Thr Leu Thr Asn Lys Glu Met Arq 

i5 260 265 270 

Arg Ala Phe Ile Arg He Met Cys Cys Lys Cys Pro Ser Gly Asp Ser 
27 5 280 285 

Ala Gly Lys Phe Lys Arg Pro He Ile Ala Gly Met Glu Phe Ser Ara 
290 295 300 

4 0 ser Lys Ser Asp Asn Ser Ser His Pro Gin Lys Asp Glu Gly Asp Asn 

305 310 315 320 

Pro Glu Thr Ile Met Ser Ser Gly Asn Val Asn Ser Ser Ser 
325 330 

(2) INFORMATION FOR SEQ ID NO: 74: 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 
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(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

He Thr Tyr Tyr He Leu He Gly Leu Cys Ala Val Val Gly Asn He 

1 5 10 15 

Leu Leu Val He Trp Val Val Lys Leu Asn Arg Thr Leu Arg Thr Thr 

20 25 30 

10 Thr Phe Tyr Phe lie Val Ser He Ala Leu Ala Asp He Ala Val Leu 

35 40 45 

Val He Pro Leu Ala He Ala Ser Ala Trp Arg Ser Arg Cys Thr Ser 
50 55 60 

Asn Cys Leu Phe Met Ser Cys Val Leu Leu Val Phe Thr His Ala Ser 
15 65 70 75 80 

lie Met Ser Leu Leu Ala He Ala Val Asp Arg Tyr Leu Arg Val Lys 
85 90 95 

Leu Thr Val Arg Tyr Arg Thr Val Thr Thr Gin Arg Arg lie Trp Leu 
100 105 110 

2 0 Phe Leu Gly Leu Cys Trp Leu Val Ser Phe Leu Val Gly Lea Thr Pro 

115 120 125 

Trp Gly Trp Asn Arg Lys Val Thr Leu Glu Leu Ser Gin Asn Ser Ser 
130 135 140 

Thr Leu Arg Glu Phe Lys Thr Pro Lys Ser Leu Phe Leu Val Leu Phe 
25 145 150 155 160 

Leu Phe Ala Leu Cys Trp Leu Pro Leu Ser lie lie Asn Phe Val Ser 
165 170 175 

Tyr Phe Asn Val Lys He Pro Glu Thr Leu Leu Gly lie Leu Leu Ser 
180 185 190 

3 0 His Ala Asn Ser Leu Pro lie Val Tyr Ala Cys Lys Lys Lys Phe Lys 

195 200 205 

Glu Thr Tyr Phe Val He Leu Arg Ala Cys Arg Leu Cys Gin Thr Ser 
210 215 220 

Asp Ser Leu Asp Ser Asn Leu Glu Gin Thr Thr Glu 

35 225 230 235 

(2) INFORMATION FOR SEQ ID NO: 75: 
(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 322 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala lie Leu H p s«*r Ph«* tu tyr- cnr T^-p n..n Tr , u w dl ^-m . L( u u 
1 5 IC 15 



40 



TT5- 



Gly Asn Ser Met Val He Tyr Val lie Leu Arg Tyr Ala Ly- Met Lys 

20 25 30 

Thr Ala Thr Asn He Tyr lie Leu Asn lie Ala lie Ala Asp Glu Leu 



WO 94/05695 



PCI7US93/08528 



10 



- 130 - 

35 40 45 

Leu Val Pro Phe Leu Val Thr Ser Thr Leu Leu Arg His Trp Pro Phe 
50 55 60 

Gly Ala Leu Leu Cys Arg Leu Val Leu Ser Val Asp Ala Val Asn Met 

65 70 75 80 

Phe Thr Ser lie Tyr Cys Leu Thr Val Leu Ser Val Asp Arg Tyr Val 
85 90 95 

Ala Val Val His Pro lie Lys Ala Ala Arg Tyr Arg Arg Pro Thr Val 
100 105 no 

Ala Lys Val Val Asn Leu Gly Val Trp Val Leu Ser Leu Leu Val He 
115 120 125 

Leu Pro He Trp Phe Ser Arg Thr Ala Ala Asn Ser Asp Gly Thr Val 
130 135 140 

Ala Cys Asn Met He Trp Glu Pro Ala Gin Phe Trp Leu Va* Gly Phe 
15 14 5 150 155 160 

Val Leu Tyr Thr Phe Leu Met Phe Leu Leu Pro Val Gly Ala He Cys 
165 170 175 

Leu Cys Tyr Val Leu He He Ala Lys Met Arg Met Val Ala Leu Lys 
180 185 190 

20 Ala Gly Trp Gin Gin Arg Lys Arg Ser Glu Arg Lys He Thr Leu Val 

195 200 205 

Met Met Val Val Met Val Phe Val He Cys Trp Phe Tyr Val Val Gin 
210 215 220 

nc Leu Val v al Phe Ala Glu Gin Asp Asp Ala Thr Val Ser Gin Leu 

25 225 230 235 240 

Ser Val He Leu Gly Tyr Ala Asn Ser Cys Ala Asn Pro He Leu Tyr 
245 250 255 

Gly Phe Leu Ser Asp Asn Phe Lys Arg Ser Phe Gin Arg He Leu Cys 
260 265 270- 

Leu Ser Leu Asn Ala Ala Glu Glu Pro Val Asp Tyr Tyr Ala Thr Ala 
275 280 285 

Leu Lys Ser Arg Ala Tyr Ser Val Glu Asp Phe Gin Pro Glu Asn Leu 
290 295 300 

■a- Glu Ser G1 y G1 Y Val Phe Arg Asn Cys Thr Cvs Ala Ser Arg He S*r 

iD 305 310 315 320 

Thr Leu 

(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 298 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptid 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

Val Thr Asn Tyr He Phe Leu Leu Leu Cys Leu Cys Gly Leu Val Glv 
1 5 ■ 10 15 
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Asn Gly Leu Val Leu Trp Phe Phe Gly Phe Ser lie Lvs Arg Thr Pro 
20 25 " 30 

Phe Ser He Tyr He Tyr Phe Leu His He Ala Ser Ala Asp Gly lie 
35 40 45 

5 Tyr Leu Phe Ser Lys Ala Val He Ala Leu Leu Asn Met Gly Thr Phe 

50 55 60 

Leu Gly Ser Phe Pro Asp Tyr Val Arg Arg Val Ser Arg He Val Gly 
65 70 75 80 

Leu Thr Phe Phe Ala Gly Val Ser Leu Leu Pro Ala lie Ser He Glu 

10 85 90 . 95 

Arg Cys Val Ser Val He Phe Pro Met Trp Tyr Trp Arg Arg Arg Pro 
100 105 110 

Lys Arg Leu Ser Ala Gly Val Cys Ala Leu Leu Trp Leu Leu Ser Phe 
115 120 125 

15 Leu Val Thr Ser lie His Asn Tyr Phe Cys Leu Leu Gly His Glu Ala 

130 135 140 

Ser Gly Thr Ala Cys Leu Asn Met Asp lie Ser Leu Leu Glv lie Leu 
14 5 150 155 - 160 

Leu Phe Phe Leu Phe Cys Pro lie Met Val Leu Pro Cys lie Ala Leu 
20 165 170 175 

Leu His Val Glu Cys Arg Ala Arg Arg Arg Gin Arg Ser Ale Lys Leu 
180 185 190 

Asn His Val Val Leu Ala lie Val Ser Val Phe Leu Val Ser Ser lie 
195 200 205 

2 5 Tyr Leu Gly lie Asp Trp Phe Leu Phe Tro Val Phe Gin lie Pro Ala 

210 215 220 

Pro Phe Pro Glu Tyr Val Arg Asp Leu Cys lie Cys He Asn Ser Ser 
225 230 235 240 

Ala L ys Pro lie Val Tyr Phe He Ala Gly Arg Asp Lys Ser Gin Arg 
30 245 250 255 

Leu Trp Glu Pro Leu Arg Val Val Phe Gin Arg Ala Leu Arg Asp Gly ' 
260 265 270 

Ala Glu Pro Gly Asp Ala Ala Ser Ser Thr Pro Asn Thr Val Thr Met 
275 280 285 

3 5 Glu Met Gin Cys Pro Ser Gly Asn Ala Ser 

290 295 

(2) INFORMATION FOR SEQ ID NO: 77: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 299 amino acids 
40 (B) TYPE: ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLE rTTT.P TVPF- p^pi- , 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Thr Thr Glu Ala Val Leu Asn Thr Phe lie lie Phe Val Glv Glv Pro 

1 E 10 * 15 



Ala lie Val Leu He Thr Gin Leu Leu Thr Asn Arg Val Leu Gly Ty 
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20 25 30 

Ser Thr Pro Thr lie Tyr Met Arg Asn Leu Tyr Ser Thr Asn Phe Leu 
. 35 40 45 

Thr Leu Thr Val Leu Pro Phe lie Val Leu Ser Asn Gin Trp Leu Leu 
5 50 55 €0 

Pro Ala Cys Tyr Val Ala Ser Cys Lys Phe Leu Ser Val lie Tyr Tyr 
65 70 * 75 80 

Ser Ser Cys Thr Val Gly Phe Ala Thr Val Ala Leu lie Ala Ala Asp 
85 90 95 

10 Arg Tyr Arg Val Leu Hie Lys Arg Thr Tyr Ala Arg Gin Ser Tyr Arg 

100 105 110 

Ser Leu Leu Leu Thr Trp Leu Ala Gly Leu lie Phe Ser Val Pro Ala 
115 120 125 

Ala Val Tyr Thr Thr Val Val Met His His Asp Ala Asn Asp Thr Asn 
15 130 135 140 

Asn Thr Asn Gly His Ala Thr Cys Val Leu Tyr Phe Val Ala Glu Glu 
145 ISO 155 160 

Val His Thr Val Leu Leu Ser Trp Lys Val Leu Leu Thr Met Val Trp 
165 170 175 

2 0 Gly Ala Ala Pro. Val lie Leu Phe Tyr Ala Phe Phe Tyr Se~ Thr Val 

180 185 190 

Gin Arg Thr Ser Gin Lys Gin Arg Ser Arg Thr Leu Thr Phe Val Ser 
195 200 205 

Val Leu Leu lie Ser Phe Val Ala Leu Gin Thr Pro Tyr Val Ser Leu 
25 210 215 220 

Met lie Phe Asn Ser Tyr Ala Thr Thr Ala Trp Pro Met Cys Glu His 
225 230 235 240 

Leu Thr Leu Arg Arg Thr lie Gly Thr Leu Ala Arg Val Val Pro His 
245 250 255 

3 0 Leu His Cys Leu lie Asn Pro lie Leu Tyr Ala Leu Leu Cys His Asp 

260 265 270 

Phe Leu Gin Arg Met Arg Gin Cys Phe Arg Gly Gin Leu lie Asp Arg 
275 280 285 

Ala Phe Leu Arg Ser Gin Gin Asn Gin Arg Ala 
35 290 295 

(2) INFORMATION FOR SEQ ID NO: 78: 
d) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

4 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 78: 

Leu Gly Val Trp Leu Met lie Val Gly Thr Phe Leu Leu Val lie Thr 
45 l s 10 is 

Thr He Leu Tyr Tyr Arg Arg Lys Lvs Lys Ser Pro Ser Asn Thr Tyr 
20 25 30' 
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lie Cys Asn Leu Ala Val Ala Asp Leu Leu lie Val Val Gly Leu Pro 
35 40 45 

Phe Phe Leu Glu Tyr Ala Lys His His Pro Lys Leu Ser Arg Glu Val 
50 55 60 

5 Val Cys Ser Gly Leu Asn Ala Cys Phe Tyr lie Cys Leu Phe Ala Gly 

65 70 ^5 80* 

Val Cys Phe Leu lie Asn Leu Ser Met Asp Arg Tyr Cys Val lie Val 
85 90 95 

Trp Gly Val Glu Leu Asn Arg Val Arg Asn Asn Lys Arg Ala Thr Cys 
10 100 105 110 

Trp Val Val He Phe Trp He He Ala Val Leu Met Glv Met Pro His 
115 120 125 

Tyr He Met Tyr Ser His Thr Asn Asn Glu Cys Val Gly Trp Phe Ala 
130 135 140 

15 Asn Glu Thr Ser Cys Trp Phe Pro Val Phe Leu Asn Thr Ly.- Val Asn 

145 150 155 160 

He Cys Gly Tyr Leu Ala Pro lie Ala Leu Met Ala Tyr Tyr Asn Arg 
165 170 175 

Met Val Arg Phe He He Asn Tyr Val Gly Lys Trp Phe Met Gin Thr 
20 iao 185 190 

Leu His Val Leu Leu Val Val Val Val Ser Phe Ala Ser Phe Trp Phe 
195 ' 200 205 

Pro Phe Asn Leu Ala Leu Phe Leu Glu Ser He Arg Leu He Ala Gly 
210 215 220 

2 5 Val Tyr Asn Asp Thr Leu Gin Asn Val He He Phe Cys Leu Tyr Val 

225 230 235 240 

Gly Gin Phe He Ala Tyr Val Arg Ala Cys Leu Asn Pro Gly He Tyr 
245 250 255 

He. Leu Val Cys Thr Trp Phe Leu Arg Val Phe Ala Cys Cys Cys Val 
30 260 265 270 

Lys Gin Glu He Pro Tyr Gin Asp He Asp He 
275 280 

(2) INFORMATION FOR SEQ ID NO: 79: 
(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 295 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Pro Val Thr Leu Phe Leu Tyr Gly Val Val Phe Leu Phe Gly Ser He 
5 10 15 



Gly Aon Phe Leu Val He Phe T hr He 1 ' hr " Trp Arg Arg Arg He Gin 

20 25 30 

4 5 Cys Ser Gly Asp Val Tyr Phe He Asn Leu Ala Ala Ala Asp Leu Leu 

35 40 45 

Phe Val Cys Thr Leu Pro Leu Trp Met Gin Tyr Leu Leu Asc His Asn 
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50 55 60 

Ser Leu Ala Ser Leu lie Pro Cys Thr Leu Leu Thr Ala Cys Phe Tvr 
65 70 75 80 

Val Ala He Thr Ala Ser Leu Cys Phe He Thr Glu He Ala Leu He 
5 85 90 95 

Asp Arg Tyr Tyr Ala He Val Tyr Met Arg Tyr Arg Pro Val Lys He 
100 105 110 

Gin Ala Cys Leu Phe Ser He Phe Trp Trp He Phe Ala Val He lie 
115 120 125 

10 Ala He Pro His Phe Met Val Val He Thr Lys Lys Asp Asn Gin Cys 

130 135 140 

Met Thr Asp Tyr Asp Tyr Leu Glu Val Ser Tyr Pro He He Leu Asn 
145 150 155 160 

Val Glu Leu Met Leu Gly Ala Phe Val He Pro Leu Ser Val He Ser 
15 165 170 175 

Tyr Cys Tyr Tyr Arg He Ser Arg He Val Ala Val Ser Gin Ser Arg 
180 185 190 

His Lys Gly Arg He Val Arg Val Leu He Ala Trp Leu Val Phe He 
195 200 205 

20 He Phe Trp Leu Pro Tyr His Leu Thr Leu Phe Val Asp Thr He He 

210 215 220 

Lys Leu Leu Lys Trp He Ser Ser Ser Cys Glu Phe Glu Arg Ser Leu 
225 230 235 240 

Lys Arg Ala Leu He Leu Thr Glu Ser Leu Ala Phe Cys His Cys Cys 
25 245 250 255 

Leu Asn Pro Leu Leu Tyr Val Phe Val He Gly Thr Lys Phe Arg Lys 
260 265 270 

Asn Tyr Thr Val Cys Trp Pro Ser Phe Ala Ser Asp Ser Phe Pro Ala 
275 280 285 

3 0 Met Tyr Pro Gly Thr Arg Ala 

290 295 

(2) INFORMATION FOR SEQ ID NO: 80: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



40 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 80: 

Asp Asp Asp Asp Asn He Trp Ser He Phe Asp Trp He Gly Tvr Leu 
15 10 15 

Asn Ser He Ser Met Val He Tyr Thr Leu Phe Lys Lys Lys Lys 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 81: 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Asp Asp Asp Asp Asn lie Trp Asn lie Phe Ser Thr lie Gly Tyr Leu 
5 15 10 15 

Asn Ser lie Ser Pro Val Ser Val lie Met His lie Tyr Gly Lys Lys 
20 25 30 

Lys Lys 

10 (2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Asp Asp Asp Asp Gly Tyr Ser lie Tyr Asp Thr Leu Val Thr Phe Ala 

15 10 15 

20 lie Asn Pro Val Tyr He Thr Val Phe Lys Lys Lys Lys 

20 25 

(2) INFORMATION FOR SEQ ID NO: 83: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: 
3 0 Asp Asp Asp Asp Asn Ala Trp Ser Ala Phe Asp Trp Ala Leu Tyr Leu 

15 10 15 

Asn Ser He Ser Met Ala He Tyr Thr Tyr Ala Lys Lys Lys Lys 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 84: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 0 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Leu Phe Ser Phe He Thr Trp Leu Gly Tyr Ala Asn Ser Ser Leu Asn 

15 10 15 

Pro He lie Tyr Thr Thr Phe 
45 20 

(2) INFORMATION FOR SEQ ID NO: 85: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

-(ii) MOLECULE TYPE. p^JLiUfci 



(xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Tyr Thr He Tyr Ser Ser Ser Val Val Phe Phe Ala Pro Ser Leu Ala 
55 1 5 10 15 



He Met Val He Thr Tyr Thr 

20 
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(2) INFORMATION FOR SEQ ID NO: 86: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE : amino acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

lie Trp Leu Thr Ser Asp lie Met Ser Thr Ser Ser lie Leu His Asn 
10 1 5 10 15 

Leu Cys Val lie Ser Phe 
20 

(2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

He Trp Ser He Phe Ser Ser Asp He Val Val Gly Tyr Ala Asn His 
15 10 15 

Ser Ser Leu Ala He Met Cys Pro He Val He Tyr Thr Va.-. 

20 25 30 

25 (2) INFORMATION FOR SEQ ID NO: 88: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

He Phe Thr He Phe Ser Ser Asp He Ala Val Gly Tyr Ala Asn His 
1 5 10 15 

35 Ser Ser Ala Ala He Met Pro He Val He Tyr Ser Val 

20 25 

(2) INFORMATION FOR SEQ ID NO: 89: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 ■ amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
45 Lys Asn Ala Ser Ala Leu Leu Ser Val He He He Asn Ser He Gly 

1 5 10 15 

Gly Asn Val Val Thr Ala Val Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 90: 
5 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
5 5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Tyr Phe Leu Met Ser Leu Ala Val Thr Asp Leu Val Val Ser Phe Val 
1 5 10 15 
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Met Pro Val Ser Ala Leu 

20 

(2) INFORMATION FOR SEQ ID NO: 91: 
(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

10 ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala He Thr Lys He Ala He Thr Trp Ala He Ser Gly Val Ser Val 
15 10 15 

Pro Phe He Pro Val Trp Gly 
20 

15 (2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino arid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pepzide 

(xi) SEQUENCE DESCRIPT:. DN : SEQ ID NO:92: 

Leu Gly He He Phe G:.v Thr Phe He He He Trp Leu Pr~ Phe Phe 

1 5 10 15 

25 He Thr Asn Leu Val . f -ir Pro He 

20 

(2) INFORMATION FOR SEQ I NO: 93: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
3 0 (B) TYPE: amin., acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: oeptide 

(xi) SEQUENCE DESCRZPTION : SEQ ID NO:93: 

3 5 lie Trp He Ser Leu Asp Val Leu Phe Ser Thr Ala Ser Ser He Met 

15 10 15 

His Leu Cys Ala He Ser Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 94: 

4 0 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
4 5 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Gly Tyr Thr He Tyr Ser Thr Leu Val Thr Phe Tyr lie Pro Ser Val 

1 ? 10 15 

He Met Val lie Thr Tyr Gly 
50 20 



(2) INFORMATION FOR SEQ ID NO: 95: 
(i) SEQUENCE CHAKAl ' lkKlbTlLS: 



(A) LENGTH: 2 3 amino acids 

(B) TYPE: amino acid 

5 5 (C) STRANDEDNESS: single 

{ D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
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Leu Leu Asn Phe Phe Asn Trp lie Gly Tyr Leu Asn Ser Leu lie Asn 
1 5 10 15 

Pro Val lie Tyr Thr Leu Phe 
20 
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22. The method of claim 21, wherein said psychotic 
disorder is a schizophrenia. 

23. The method of claim 20, wherein said composition is 
administered to provide said polypeptide, fragment or consensus 

5 peptide thereof, in an amount ranging from about 0.01 fig to 100 mg/kg 
per day. 

24. The method of claim 23, wherein said composition is 
administered to provide said polypeptide, fragment or consensus 
peptide thereof, in an amount ranging from about lOjxg to 10 mg/kg per 

10 day. 

25. The method of claim 20, wherein said administering is 
by oral, mucosal, intravenous, intramuscular or parenteral 
administration . 

26. A method for producing a polypeptide according to 
15 claim l, wherein said polypeptide is a recombinant polypeptide 

obtained from a recombinant host which expresses a heterologous 
nucleic acid encoding said polypeptide, comprising the steps of: 

(A) providing a host comprising a recombinant nucleic 
acid encoding a polypeptide according to claim 1 in 

20 expressible form; 

(B) culturing said host under conditions such that 
said polypeptide is expressed in recoverable amounts; and 

(C) recovering said polypeptide produced by said 

host . 

25 27. The method of claim 26, further comprising: 

<D) purifying said polypeptide. 

28. The method of claim 26, wherein said host is a 
bacteria or a eukaryotic cell. 

29. The method of claim 28, wherein said eukaryotic cell 
3° is a mammalian cell, an insect cell or a yeast cell. 

30. A method for producing a polypeptide according to 
claim 1, comprising: 

. (A) chemically synthesizing a polypeptide according 
to claim l in recoverable amounts; and 
35 (B) recovering said polypeptide. 
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31- A method for isolating a G-protein coupled receptor, 
fragment or consensus sequence thereof, or a protein that binds the 
G-protein coupled receptor, comprising 

(AO providing a bound support, said support being 
5 bound to a polypeptide according to claim i, or an 

antibody, anti - idiotype antibody, or a fragment thereof; 

(B) contacting a sample containing said G-protein 
coupled receptor or said protein that binds a G-protein 
coupled receptor to said bound support, such that said 

10 receptor or protein is reversibly bound to said bound 

support ; and 

(C) recovering said receptor or protein that is 
attached to the bound support by dissociating the receptor 
or protein under conditions that cause elution or 

15 dissociation of the receptor or protein from said bound 

support . 

32. A method according to claim 31, wherein said GPR is 
a dopamine receptor. 

33. An antibody, anti- idiotype antibody or a fragment of 
20 said antibody or anti- idiotype antibody, that specifically displays 

an epitope of a G-protein coupled receptor polypeptide, according to 
claim 1. 

34. A recombinant nucleic acid comprising a nucleotide 
sequence encoding a G-protein coupled receptor polypeptide according 

25 to claim 1. 

35. A vector comprising a nucleic acid according to claim 

34. 

36. A host cell comprising the nucleic .icid of claim 34. 

37. A host cell according to claim 36, wherein said host 
30 cell is selected from a mammalian cell, a yeast cell, a bird cell or 

an insect cell. 

38. A host cell according to claim 36, wherein, when said 
nucleic acid is expressed as said receptor polypeptide in said host 
cell, a receptor binding molecule comprising said env binding domain 

3 5 binds to said receptor polypeptide. 
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WHAT IS CLAIMED IS: 

1. A G-protein coupled receptor polypeptide, consisting 
essentially of an amino acid sequence of 15 to 40 amino acids 
substantially corresponding to a fragment or consensus peptide of a 

5 transmembrane domain of a G-protein coupled receptor, wherein said 
polypeptide has a GPR- related biological activity selected from 
binding a GPR ligand or modulating GPR ligand binding to a GPR. 

2. A polypeptide according to claim 1, wherein said 
polypeptide is selected from a synthetic polypeptide, a recombinant 

10 polypeptide or a purified polypeptide. 

3. A polypeptide according to claim 1, wherein said G- 
protein coupled receptor is a receptor selected from a cAMP receptor, 
an adenosine receptor, a 0-adrenergic receptor, a muscarinic 
acetylcholine receptcr, an a-adrenergic receptor, a serotonin 

15 receptor, a histamine H2 receptor, a thrombin receptor, a kinin 
receptor, a follicle stimulating hormone receptor, an opsin, a 
rhodopsin, an odorant receptor, a cytomegalovirus receptor, or a mas 
oncogene GPR. 

4. A polypeptide according to claim 1, wherein said 
20 transmembrane domain is selected from at least one of transmembrane 

domain TM1 , TM2 , TM3 , TM4 , TM5 # TM6 or TM7 . 

5 . A polypeptide according to claim 3 , wherein said 
transmembrane domain is a D : receptor transmembrane segment III or 
segment V. 

25 6. A polypeptide according to claim 4, wherein said 

polypeptide has the amino acid sequence of Fig. 2 (SEQ ID NO: 2 ) . 

7. A polypeptide according to claim 4, wherein said 
polypeptide has the amino acid sequence of Fig. 3 (SEQ ID N0:3). 

8. A polypeptide according to claim 4, wherein said 
3 0 polypeptide has an amino acid sequence selected from one of SEQ ID 

NOS:80-95. 

9 . A polypeptide according to claim 4 , wherein said 
polypeptide has an amino acid sequence of one of SEQ ID NOS:96-348. 

10. A polypeptide according to claim 9 whprpin — said 

35 polypeptide has an amino acid sequence from one of SEQ ID NOS:96-225. 
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11. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS-.226- 
289. 

12. A polypeptide according to claim 9, wherein said 
5 polypeptide has an amino acid sequence from one of SEQ ID NOS:290- 

297. 

13. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS:298- 
324. 

10 14. A polypeptide according to claim 9, wherein said 

polypeptide has an amino acid sequence from one of SEQ ID NOS:32 5- 
338. 

15. A polypeptide according to claim 9, wherein said 
polypeptide has an amino acid sequence from one of SEQ ID NOS:339- 

15 348. 

16. A polypeptide according :o claim 3, wherein said 
transmembrane domain is a dopamine receptor transmembrane domain 
selected from the group consisting of a D,, D 2 D 3# D 4 or D 3 
transmembrane domain. 

20 17. a composition comprising a polypeptide according to 

claim 1, or a pharmaceutically acceptable ester, ether, sulfate, 
carbonate, glucuronide or salt thereof, and a pharmaceutically 
acceptable carrier. 

18. A composition according to claim 16, wherein said 
25 transmembrane domain is D 2 receptor transmembrane segment III or 

segment V. 

19. A composition according to claim 18, further 
comprising a drug selected from a phenothiazine derivative, a 
thioxanthine derivative, a butyrophenone derivative, a 

30 dihydroindolone, a dibenzoxazepine derivative and an atypical 
neuroleptic . 

20. A method for treating a subject suffering from a 
pathology related to an abnormality of a G-protein coupled receptor, 
comprising administering to said subject a therapeutically effective 

35 amount of composition according to claim 16. 

21. The method of claim 20, wherein said pathology is a 
psychotic disorder. 
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39. A host cell according to claim 37, wherein said hose 



cell is a mammalian cell selected from a human cell, a primate cell 
or a rodent cell . 

40. A • method for isolating a protein that binds a 
G-protein coupled receptor, comprising 



(A) providing a bound support, said support being 
bound to a polypeptide according to claim l, or 
anti- idiotype antibody thereto, - 

(B) contacting a sample containing said protein that 
binds a G-protein coupled receptor to said bound support, 
such that said protein is reversibly bound to said bound 
support ; and 

(C) recovering said protein that is attached to the 
bound support by dissociating the receptor or protein under 
conditions that cause elution or dissociation of the 
protein from said bound support. 

41. A method according to claim 40, wherein said GPR is 



a dopamine receptor. 
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